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Preface 



The book at hand on "New Approaches in Automation and Robotics" offers in 
22 chapters a collection of recent developments in automation, robotics as well as 
control theory. It is dedicated to researchers in science and industry, students, and 
practicing engineers, who wish to update and enhance their knowledge on modern 
methods and innovative applications. 

The authors and editor of this book wish to motivate people, especially under- 
graduate students, to get involved with the interesting field of robotics and mecha- 
tronics. We hope that the ideas and concepts presented in this book are useful for 
your own work and could contribute to problem solving in similar applications as 
well. It is clear, however, that the wide area of automation and robotics can only be 
highlighted at several spots but not completely covered by a single book. 

The editor would like to thank all the authors for their valuable contributions to 
this book. Special thanks to Editors in Chief of International Journal of Advanced 
Robotic Systems for their effort in making this book possible. 
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A Model Reference Based 2-DOF Robust 
Observer-Controller Design Methodology 

Salva Alcantara, Carles Pedret and Ramon Vilanova 

Autonomous University of Barcelona 

Spain 



1. Introduction 

As it is well known, standard feedback control is based on generating the control signal 
u by processing the error signal, e = r — y , that is, the difference between the reference 

input and the actual output. Therefore, the input to the plant is 

u = K[r-y) (1) 

It is well known that in such a scenario the design problem has one degree of freedom (1- 
DOF) which may be described in terms of the stable Youla parameter (Vidyasagar, 1985). 
The error signal in the 1-DOF case, see figure 1, is related to the external input r and d by 

means of the sensitivity function S — (1 + P K) , i.e., e = S(r — d) . 



/', 



Fig. 1. Standard 1-DOF control system. 

Disregarding the sign, the reference r and the disturbance d have the same effect on the 
error e . Therefore, if r and d vary in a similar manner the controller K can be chosen to 
minimize e in some sense. Otherwise, if r and d have different nature, the controller has to 
be chosen to provide a good trade-off between the command tracking and the disturbance 
rejection responses. This compromise is inherent to the nature of 1-DOF control schemes. To 
allow independent controller adjustments for both r and d , additional controller blocks 
have to be introduced into the system as in figure 2. 

Two-degree-of-freedom (2-DOF) compensators are characterized by allowing a separate 
processing of the reference inputs and the controlled outputs and may be characterized by 
means of two stable Youla parameters. The 2-DOF compensators present the advantage of a 
complete separation between feedback and reference tracking properties (Youla & 
Bongiorno, 1985): the feedback properties of the controlled system are assured by a feedback 
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Fig. 2. Standard 2-DOF control configuration. 

controller, i.e., the first degree of freedom; the reference tracking specifications are 
addressed by a prefilter controller, i.e., the second degree of freedom, which determines the 
open-loop processing of the reference commands. So, in the 2-DOF control configuration 
shown in figure 2 the reference r and the measurement y, enter the controller separately and 
are independently processed, i.e., 



u=K 



Kr - Ky 



(2) 



As it is pointed out in (Vilanova & Serra, 1997), classical control approaches tend to stress 
the use of feedback to modify the systems' response to commands. A clear example, widely 
used in the literature of linear control, is the usage of reference models to specify the desired 
properties of the overall controlled system (Astrom & Wittenmark, 1984). What is specified 
through a reference model is the desired closed-loop system response. Therefore, as the 
system response to a command is an open-loop property and robustness properties are 
associated with the feedback (Safonov et al., 1981), no stability margins are guaranteed 
when achieving the desired closed-loop response behaviour. 

A 2-DOF control configuration may be used in order to achieve a control system with both a 
performance specification, e.g., through a reference model, and some guaranteed stability 
margins. The approaches found in the literature are mainly based on optimization problems 
which basically represent different ways of setting the Youla parameters characterizing the 
controller (Vidyasagar, 1985), (Youla & Bongiorno, 1985), (Grimble, 1988), (Limebeer et al., 
1993). 

The approach presented in (Limebeer et al., 1993) expands the role of H optimization tools 

in 2-DOF system design. The 1-DOF loop-shaping design procedure (McFarlane & Glover, 
1992) is extended to a 2-DOF control configuration by means of a parameterization in terms 
of two stable Youla parameters (Vidyasagar, 1985), (Youla & Bongiorno, 1985). A feedback 
controller is designed to meet robust performance requirements in a manner similar as in 
the 1-DOF loop-shaping design procedure and a prefilter controller is then added to the 
overall compensated system to force the response of the closed-loop to follow that of a 
specified reference model. The approach is carried out by assuming uncertainty in the 
normalized coprime factors of the plant (Glover & McFarlane, 1989). Such uncertainty 

description allows a formulation of the 7i robust stabilization problem providing explicit 

formulae. 

A frequency domain approach to model reference control with robustness considerations 
was presented in (Sun et al., 1994). The design approach consists of a nominal design part 
plus a modelling error compensation component to mitigate errors due to uncertainty. 
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However, the approach inherits the restriction to minimum-phase plants from the Model 
Reference Adaptive Control theory in which it is based upon. 

In this chapter we present a 2-DOF control configuration based on a right coprime 
factorization of the plant. The presented approach, similar to that in (Pedret C. et al., 2005), 
is not based on setting the two Youla parameters arbitrarily, with internal stability being the 
only restriction. Instead, 

1. An observer-based feedback control scheme is designed to guarantee robust stability. 

This is achieved by means of solving a constrained 7i optimization using the right 

coprime factorization of the plant in an active way. 

2. A prefilter controller is added to improve the open-loop processing of the robust closed- 
loop. This is done by assuming a reference model capturing the desired input-output 
relation and by solving a model matching problem for the prefilter controller to make 
the overall system response resemble as much as possible that of the reference model. 

The chapter is organized as follows: section 2 introduces the Observer-Controller 
configuration used in this work within the framework of stabilizing control laws and the 
Youla parameterization for the stabilizing controllers. Section 3 reviews the generalized 

control framework and the concept of 7i optimization based control. Section 4 displays the 

proposed 2-DOF control configuration and describes the two steps in which the associated 
design is divided. In section 5 the suggested methodology is illustrated by a simple 
example. Finally, Section 6 closes the chapter summarizing its content and drawing some 
conclusions. 

2. Stabilizing control laws and the Observer-Controller configuration 

This section is devoted to introduce the reader to the celebrated Youla parameterization, 
mentioned throughout the introduction. This result gives all the control laws that attain 
closed-loop stability in terms of two stable but otherwise free parameters. In order to do so, 
first a basic review of the factorization framework is given and then the Observer-Controller 
configuration used in this chapter is presented within the aforementioned framework. The 
Observer-Controller configuration constitutes the basis for the control structure presented in 
this work. 

2.1 The factorization framework 

A short introduction to the so-called factorization or fractional approach is provided in this 
section. The central idea is to factor a transfer function of a system, not necessarily stable, as 
a ratio of two stable transfer functions. The factorization framework will constitute the 
foundations for the analysis and design in subsequent sections. The treatment in this section 
is fairly standard and follows (Vilanova, 1996), (Vidyasagar, 1985) or (Francis, 1987). 

2.1.2 Coprime factorizations over IZTi^ 

A usual way of representing a scalar system is as a rational transfer function of the form 

n(s) 
P(s) = ^ ± (3) 

m(s) 
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where n(s) and m(s) are polynomials and (3) is called polynomial fraction representation 
of P (s) . Another way of representing P (s) is as the product of a stable transfer function 
and a transfer function with stable inverse, i.e., 

P(s) = N(s)M~'(s) (4) 

where N(s), M(s) e TVH , the set of stable and proper transfer functions. 
In the Single-Input Single-Output (SISO) case, it is easy to get a fractional representation in 
the polynomial form (3). Let S(s) be a Hurwitz polynomial such that 
deg S(s) = deg m(s) and set 

n(s) m(s) 

N(s) = — - MO) = — — (5) 

S(s) S(s) 

The factorizations to be used will be of a special type called Coprime Factorizations. Two 
polynomials n(s) and m(s) are said to be coprime if their greatest common divisor is 1 (no 
common zeros). It follows from Euclid's algorithm - see for example (Kailath, 1980) - that 
n(s) and m(s) are coprime iff there exists polynomials x(s) and y(s) such that the 
following identity is satisfied: 

x(s)m(s) + y(s)n(s) = 1 (6) 

Note that if z is a common zero of n{s) and m{s) then x(z)m(z) + y(z)n(z) = and 
therefore n(s) and m(s) are not coprime. This concept can be readily generalized to transfer 

functions N(s),M(s),X(s),Y(s) in Tffl^ . Two transfer functions M(s),N(s) in 1ZH X are 

coprime when they do not share zeros in the right half plane. Then it is always possible to 

find X{s), Y(s) in KH x such that X(s)M(s) + Y(s)N(s) = 1 . 

When moving to the multivariable case, we also have to distinguish between right and left 
coprime factorizations since we lose the commutative property present in the SISO case. 
The following definitions tackle directly the multivariable case. 

Definition 1. (Bezout Identity) Two stable matrix transfer functions N and M are right 
coprime if and only if there exist stable matrix transfer functions X and Y such that 



[x, r] 



M 

r 

N. 



■■ XM r + YN r = I (7) 



Similarly, two stable matrix transfer functions N and M are left coprime if and only if 
there exist stable matrix transfer functions X, and Y, such that 
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[M, N,] 



X, 



M l X l +N l Y l =I 



(8) 



The matrix transfer functions X r , F ( X / , F ( ) belonging to 7l7~t x are called right (left) Bezout 

complements. 

Now let P (s) be a proper real rational transfer function. Then, 

Definition 2. A right (left) coprime factorization, abbreviated RCF (LCF), is a factorization 

P(s) = N r M~' (P o (s) = M~'N I ), where N^M^N^M,) are right (left) coprime over 

KH x . 

With the above definitions, the following theorem arises to provide right and left coprime 
factorizations of a system given in terms of a state-space realization. Let us suppose that 



A 


B~ 


C 


D 



w 



is a minimal stabilisable and detectable state-space realization of the system P (s) . 
Theorem 1. Define 



(9) 



M r 


< 




' A + BF 


B 


-L 


— 


F 


I 





N 

r 


X ,- 




C + DF 


-D 


/ 



X Y 



-N, M u 



A + LC 



F 
C 



-{B + LD) -L 



I 
-D 1 



(10) 



where F and L are such that A + BF and A + LC are stable. Then, P (s) = N r (s)M ~' (s) 

( P (s) = Mf' (s)N, (s) ) is a RCF (LCF). 

Proof. The theorem is demonstrated by substituting (1.10) into equation (1.7). 

Standard software packages can be used to compute appropriate F and L matrices 

numerically for achieving that the eigenvalues of A + BF are those in the vector 

P F =[P F /-Prj (11) 

Similarly, the eigenvalues of A + LC can be allocated in accordance to the vector 



\ =[pl, -Pl.'] 



(12) 
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By performing this pole placement, we are implicitly making active use of the degrees of 
freedom available for building coprime factorizations. Our final design of section 4 will 
make use of this available freedom for trying to meet all the controller specifications. 

2.2 The Youla parameterization and the Observer-Controller configuration 

A control law is said to be stabilizing if it provides internal stability to the overall closed- 
loop system, which means that we have Bounded-Input-Bounded-Output (BIBO) stability 
between every input-output pair of the resulting closed-loop arrangement. For instance, if 

we consider the general control law u = K 2 r — K l y in figure 3a internal stability amounts to 
being stable all the entries in the mapping yr,d.,d o ) — > \u,y) . 

Let us reconsider the standard 1-DOF control law of figure 1 in which u = K(r — y) . For 

this particular case, the following theorem gives a parameterization of all the stabilizing 
control laws. 

Theorem 2. (1-DOF Youla parameterization) For a given plant P = N M , let 
C (P) denote the set of stabilizing 1-DOF controllers K { , that is, 

C (P) = \K l : the control law u = K l (r - y) is stabilizing) . (13) 

The set C (P) can be parameterized by 

[x +M Q I 

c „ (P) = — — = Q, e nn (14) 

U-^e, J 

As it was pointed out in the introduction of this chapter, the standard feedback control 
configuration of figure 1 lacks the possibility of offering independent processing of 
disturbance rejection and reference tracking. So, the controller has to be designed for 
providing closed-loop stability and a good trade-off between the conflictive performance 
objectives. For achieving this independence of open-loop and closed-loop properties, we 

added the extra block K 2 (the prefilter) to figure 1, leading to the standard 2-DOF control 

scheme in figure 2. Now the control law is of the form 

u = K 2 r-K iy (15) 

where K l and K 2 are to be chosen to provide closed-loop stability and meet the performance 

specifications. This control law is the most general stabilizing linear time invariant control 
law since it includes all the external inputs ( y and r ) in u . 

Because of the fact that two compensator blocks are needed for expressing u according to 
(15), 2-DOF compensators are also referred to as two-parameter compensators. It is worth 
emphasizing that (15) represents the most general feedback compensation scheme and that, 
for example, there is no three-parameter compensator. 
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(b) (c) 

Fig. 3. (a) 2-DOF control diagram, (b) An unfeasible implementation of the 2-DOF control 
law u = K r - K y . (c) A feasible implementation of the control law u = Kr - K y . 

It is evident that if we make K t = K^ = K , then we have u = K(r — y) and recover the 

standard 1-DOF feedback configuration (1 parameter compensator) of figure 1. Once we 

have designed K t and K^ , equation (15) simply gives a control law but it says nothing about 

the actual implementation of it, see (Wilfred, W.K. et al., 2007). For instance, in figure 3b we 
can see one possible implementation of the control law given by (15) which is a direct 
translation of the equation into a block diagram. It should be noted that this implementation 

is not valid when K 2 is unstable, since this block acts in an open-loop fashion and this 

would result in an unstable overall system, in spite of the control law being a stabilizing 
one. To circumvent this problem we can make use of the previously presented factorization 

framework and proceed as follows: define C = [K { ^ 2 ]and let K =M iV ;/a and 

K, =M,' l N, r , such that (M, r ,[N, r , N, r ,T) is a LCF ofC. Once C = [K K ,1 has 
been factorized as suggested, the control action in (15) can be implemented as shown in 
figure 3c. In this figure the plant has been right-factored as N M . It can be shown that the 
mapping (r,d.,d ) —> (z ] ,z i ,u,y)Temcims stable (necessary for internal stability) if and 

only if so it does the mapping (r,d,d ) — > (u, y) . The following theorem states when the 

system depicted in figure 3c is internally stable. 

Theorem 3. The system of figure 3c is internally stable if and only if 



R := M lc M r + N lJC2 N r e KH x , R e KH x 
We can proceed now to announce the 2-DOF Youla Paramaterization. 



(16) 
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Theorem 4. (2-DOF Youla parameterization) For a given plant P = N M , let C (P) 
denote the set of stabilizing 2-DOF controllers C = [K K ] , that is, 



C Mh (P) = {C = [K { , K 2 ] : the control law u = K 2 r - K { y) is stabilizing} . 
The set C (P) can be parameterized as follows 



(17) 






r r -*-• y 



Or 



:Q Q r eRH r 



(18) 



Y r - NO Y - NO 

Proof. Based on theorem 2, it follows that the transfer function R will satisfy theorem 3 if 
and only if M /c :'N /KI equals (7 -QN r )~\X r +QM r ) for some Q y in KH x such that 

7 — Q N * . Moreover, R is independent of N, K . This leads at once to (18). 
Following with figure 3c, let us assume that we take 



N, 



1. N hK1 =K x X r , M, 



1 + ^,7 



(19) 



where K l e T^H^ ■ Then the two-parameter compensator can be redrawn as shown in figure 

4a. For reasons that will become clear later on, this particular two-parameter compensator is 
referred to as the Observer-Controller scheme. 



■+Q-+ (1 ! K,X T ) ' 



y 



K,Y,. 



o 



M r 



N. 



'J 



x, kh y, 

l' 



K, 



(a) (b) 

Fig. 4. (a) Observer-Controller in two blocks form, (b) Observer-Controller in three blocks 
form where P Q = NM~* is a RCF. 

Applying theorem 3 for the particular case at hand the stability condition for the system of 
figure 4a reduces to 

R~ l ={\ + K x X r )M r + K 1 YN r = M, +1,6 KH a , R e TlH x (20) 

It can be verified that the relation between r and y is given by N R . In order to T being 
stable, we have to require R to be stable. On the other hand, R is given by M + K x which 
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is stable having chosen K l stable. Choosing such an R for our design the stability 

requirements for the overall system to be internally stable are satisfied. 

It is easy to see that figure 4a can be rearranged as in figure 4b, where the plant appears in 

right-factored form (P =NM ). Now it is straightforward to notice that the relation 
between x and x is given by 

x ={XM r +YN r )x = x (21) 

where the Bezout identity applies. This way, theX and Y blocks can be though of as an 

observer for the fictitious signal x appearing in the middle of the RCF. So, feeding back the 
observation of x lets to place the close-loop eigenvalues at prescribed locations since the 

achieved input to output relations is given by y = N Rr and the stable poles of both N 

and R are freely assignable. This may remind of a basic result coming from state-space 
control theory associated with observed state feedback: assuming a minimal realization of 
the plant, state feedback using observers let you change the dynamics of the plant by 
moving the closed-loop poles of the resulting control system to desired positions in the left 
half plane. Let us assume the following situation for the figure 4b 

b a b « n 

P = -, M r = , N r = , X r =^-, Y=— (22) 



Pk Pk Pl Pl 



Now let us take K { to be of the form 



K,=— (23) 

Pk 

being m an arbitrary polynomial in s of degree n-1. With p R and p we refer here to monic 
polynomials in s having as roots the entries of the vectors in (11) and (12), respectively .The 
dependence of s has been dropped to simplify the notation. By choosing this stable K { the 
relation between the input r and the output y remains as follows 



Tyr= (24) 

a + m 

So we have achieved a reallocation of the closed-loop poles leaving the zeros of the plant 
unaltered, as it happens in the context of state-space theory when one makes use of 
observed state feedback. 

What follows is intended to fully understand the relationship between the scheme of figure 
4 and conventional state-feedback controllers. For this purpose, we will remind here results 
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appearing in (Kailath, 1980), among others. Let us assume that the system input-output 
relation is given in the form 



b 
y = —u 
a 



(25) 



One can now replace equation (25) by the following two 



1 



y=bx p (26) 

And choose the following state variables for describing the system in the state space 



x = —u 
p 

a 



X — X — X 

P 2 1 



(27) 



This leads to the well-known canonical controllable form realization 



1 









\b - a b - a 



X 

p 







X 

p 


+ 





("-2) 
X 
p 







X 

l_ p _1 




_1_ 



b -a ] 



(28) 



The corresponding realization is shown in figure 5a. 

The point is that the fictitious signal x can be used to determine the complete state (in the 

controllable canonical form realization) of the system by just deriving it n-1 times. Now 
suppose that z and w are polynomials such that 



za + wb = 1 



(29) 
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(d) 

Fig. 5. (a) Controllable canonical form realization of — . (b) Unfeasible observed-based state- 
feedback scheme, (c) Towards a feasible observer-controller: part I. (d) Part II. 

In figure 5b we can see a way of thinking of a state-feedback controller. Through z and 
w we observe x p and by multiplying it by m we achieve an arbitrary linear combination of 
x p and its derivatives, that is, a state feedback control law. Obviously, the scheme as such 
can not be implemented. But it is easy to make it realizable by introducing a nth-order 
polynomial (the so-called observer polynomial indeed) as in figure 5c, then zmd and 
wmd can be made of degree equal or less than n - see (Kailath, 1980) - without altering the 
state feedback gain, leading to n and n in figure 5d. So figure 5 summarises a procedure 

entirely based on the transfer function domain (but though at the level of polynomials) to 
implement a state-feedback control law. However, the scheme in figure 5d is not exactly the 
one we will work with. 

By introducing another nth-degree stable polynomial ( a g ) figure 5c can be redrawn as in 

figure 6a. 

By doing this we are considering that our plant is the series connection of two systems, that 

is P = PP, where ^=— , p 2 = ~~ • ^° we are considering on purpose a non-minimal 

a a Q 

realization of the plant. The series connection system is not completely controllable but 
completely observable. Let denote by A,B,C,D the corresponding realization matrices of 

P in terms of the realization matrices of P (A ,B ,C ,D ) and P ( A 2 , B 1 , C 1 , D 2 ) in 
controllable canonical form. Then we arrive at the non-minimal realization 
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! ba 
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\D 2 D 



(30) 
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where the state vector for P is of the form x = I x l x 2 I , being x { and x 2 the state vectors 

of P and P , respectively, in controllable canonical form. In more detail, the state matrix 
A of P is given by 
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Fig. 6. (a) Non-realizable Observer-Controller configuration, (b) Realizable Observer- 
Controller configuration, (c) Realizable observer-controller put in the form of a standard 
observed state feedback (i.e., figure 5d). 

Now it is straightforward to see that 



j A 2 



TAT' 



(32) 
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where 
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(33) 



By using this similarity state transformation the new realization matrices are given by 
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(34) 



where c. J= V? = \,,.1n .From (34) it is evident that the controllable states are the n first 

states, which are obtainable through x and its n-1 succesive derivatives. Besides, it is easy 

to see that the similarity transformation employed does not alter the first n state 
components. The approach taken in this work consists of observing the 2« states of the 
non-minimal realization (34) and consider just the n first states corresponding to the 

controllable part (this partial vector state of dimension n is equal to the state of - — in 



Hs) 

«w 



controllable canonical form) for state feedback. By doing this, see figure 6b, we are 
introducing n extra degrees of freedom (the n roots of the Hurwitz polynomial p ) into the 
design. In figures 6b and 6c we have returned to the terminology of section 2.1 when we 
introduced the coprime factorizations over TZTi , with respect to figure 6a the following 

identities hold: p K = a , p L = d . The term Observer-Controller is used in this work to make 

reference to an observed-based state feedback control system designed following this 
approach. The method presented in section 4 uses the extra freedom which arises from 

using a non-minimal order observer (see figure 6c, where the observer polynomial p K p L has 

degree In , being n the order of the plant) for trying to meet more demanding objecties. 



3. 7Y -norm optimization based robust control systems design 

In this section we review the general method of formulating control problems introduced by 
(Doyle, 1983). Within this framework, we recall the general method for representing 
uncertainty for multivariable systems and determine the condition for robust stability in the 
presence of unstructured additive uncertainty. The presentation is fairly standard, we refer 
the reader to (Skogestad S., 1997) for a more detailed treatment. 

3.1 General control problem formulation 

Within the general control configuration (Doyle, 1983) of figure 9, G is referred to as the 
generalized plant and K is the generalized controller. Four types of external variables are 
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dealt with: exogenous inputs, w , i.e., commands, disturbances and noise; exogenous 
outputs, z, e.g., error signals to be minimized; controller inputs, v, e.g., commands, 
measured plant outputs, measured disturbances; and control signals, u. 



G 



Fig. 7. Generalized plant and controller. 

The controller design problem is divided into the analysis and the synthesis phases. The 
controller K is synthesized such that some measure, mathematically a norm, of the transfer 

function from wto z is minimized, e.g. the TL norm. 

Definition 3 ( 7i m -norm) The TL x -norm of a proper stable system P is given by 



\\P(s)\l= sap a (PUa>)) 



(35) 



where cr(P) denotes de largest singular value of the matrix P . 

In words, the 7i r -norm of a dynamic system is the maximum amplification the system can 

make to the energy of the input signal in any direction. In the SISO case it is equal to the 
maximum value of the system's frequency response magnitude (the magnitude peak in the 
Bode diagram). For the general MIMO case it is equal to the system's largest singular value 
over all the frequencies. From this point on with every mention to a norm we would 

implicitly be considering the above defined Ji norm, and no further remarks will be made. 

The controller design amounts to find a K that minimizes the closed-loop norm from w to 
z in figure 7. For the analysis phase the designer has to make the actual system meet the 
form of a generalized control problem according to figure 7. Standard software packages 
exist that solve numerically the synthesis problem once the problem has been put in the 
generalized form. In order to get meaningful controller synthesis problems, frequency 
weights on the exogenous inputs w and outputs z are incorporated to perform the 
corresponding optimizations over specific frequency ranges. 

Once the stabilizing controller K is synthesized, it rests to analyse the closed-loop 
performance that it provides. In this phase, the controller for the configuration in figure 9 is 
incorporated into the generalized plant G to form the system J\f , as shown in figure 11 



if 



Af 



Fig. 8. Relation between w and z in the generalized control problem. 

It is relatively straightforward to show that the expression for Af is given by 
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AT = G u +G u K(l-G 22 KY'G 2l = ^(G,K) 



(36) 



where J- denotes the lower Linear Fractional Transformation (LFT) of G and K . In order 

to obtain a good design for K , a precise knowledge of the plant is required. The dynamics 
of interest are modeled but this model may be inaccurate (this is usually the case indeed). To 
deal with this problem the real plant P is assumed to be unknown but belonging to a class 

of models built around a nominal model P . This set of models is characterized by a matrix 

A , which can be either a full matrix (unstructured uncertainty) or a block diagonal matrix 
(structured uncertainty), that includes all possible perturbations representing uncertainty to 

the system. Weighting matrices W t and W 2 are usually employed to express the uncertainty 

in terms of normalized perturbations in such a way that A < 1 . The general control 

configuration in figure 9 may be extended to include model uncertainty as it is shown in 
figure 9 
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Fig. 9. Generalized control problem configuration. 

The block diagram in figure 9 is used to synthesize a controller K . To transform it for 
analysis, the lower loop around G is closed by the controller K and it is incorporated into 
the generalized plant G to form the system Af as it is shown in figure 10. The same lower 
LFT is obtained as if no uncertainty was considered. 
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Fig. 10. Generalized block diagram for analysis in the face of uncertainty. 

To evaluate the relation from w = I W i w, J to z = [z { z ] for a given controller AT in the 

uncertain system, the upper loop around M is closed with the perturbation matrix A . This 
results in the following upper LFT: 

T u (Af, A) = N 22 + A/" 21 A(1 - Af u AY' 7V 12 (37) 

and so z = J- (Af, A)w . To represent any control problem with uncertainty by the general 
control configuration it is necessary to represent each source of uncertainty by a single 
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perturbation block A , normalized such that A < 1 . We will assume in this work that we 
can collect all the sources of uncertainties into a single full (unstructured) matrix A . 

3.2 Uncertainty and robustness 

As already commented, an exact knowledge of the plant is never possible. Therefore, it is 
often assumed that the real plant, denoted by P , is unknown but belonging to a set of class 

models characterized somehow by A and with centre P . 

Definition 4 (Nominal Stability) The closed-loop system of figure 9 has Nominal Stability 

(NS) if the controller K internally stabilizes the nominal model P ( A = ), i.e., the four 

transfer matrices Af n , M u , W 2] , Af a in the closed-loop transfer matrix M shown in figure 

13 are stable. 

Definition 5 (Nominal Performance) The closed-loop system of figure 12 has Nominal 

Performance (NP) if the performance objectives are satisfied for the nominal model P , i.e., 

LV 22 < 1 in figure 10 assuming A = . 

Definition 6 (Robust Stability) The closed-loop system has Robust Stability (RS) if the 
controller K internally stabilizes the closed-loop system in figure 9 ( T (J\f ', A) ) for every 

A such that ||a|| < 1 . 

We will just consider in this work additive uncertainty, which mathematically is expressed 
as 

V A ={P:P = P+ W l A} (38) 

Being W A a scalar frequency weight and A < 1 . Now that we know how to describe the 

set of plants which our real plant is supposed to lie in the next issue is to answer the 
question of when a controller stabilizes all the plants belonging to this set. 
Theorem 5 (Robust Stability for unstructured uncertainty) Let us assume that we have 
posed our system in the form illustrated by figure 9. The overall system is robustly stable 
(see definition 6) iff 

IK IL^i ( 39 ) 

where A^ has been defined in (36), see figure 10. 

Robust stability conditions for the different uncertainty representations can be derived by 
posing the corresponding feedback loops as in figure 9 and then applying theorem 5, also 
known as the small gain theorem. See (Morari and Zafirou, 1989) for details. 

4. The design for the proposed robust 2-DOF Observer-Controller 

In this section a methodology for designing 2-DOF controllers is provided. The design is 
based on the Observer-Controller configuration described in section 2.2. In order to have a 

2-DOF scheme a prefilter block ( K 2 ) has been added, leading to the general scheme shown 

in figure 11 
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Fig. 11. The proposed 2-DOF control configuration. 

The controller blocks X ,Y ,K which implicitly fix the Youla parameter Q of theorem 5 

will be in charge of providing robust stability and good output disturbance rejection. On the 

other hand, the prefilter K^ (the Youla parameter Q of theorem 5) has to cope with the 

tracking properties of the system by solving a model matching problem with respect to a 
specified reference model which describes the desired closed-loop behaviour for the 
resulting controlled system. 

4.1 Step I: Design of the Observer-Controller part through direct search optimization 

In section 2.2 we characterized the Observer-Controller configuration in terms of the 

polynomials p , p L and m . Let us assume without loss of generality that additive output 

uncertainty (38) is considered. In this first step of the design the objective will be to find 

convenient p K ,p,m, defining entirely X ,Y ,K in figure 12. This search will be 

performed in order to provide robust stability with the best possible output disturbance 
rejection. 
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Fig. 12. Observer-Controller part with additive uncertainty. 

More specifically, for the scheme in figure 12 the following relations hold 

M r (l-RM r )Y r M r R 

W p (\-N r (\-RM r )Y) r N r R 



(40) 
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where all the terms have been defined in section 2.2. It can be proved by applying theorem 5 
to figure 12 (once put in the generalized controller configuration of figure 9) that robust 
stability of the system in figure 12 amounts to satisfy the following inequality 



u'd„ 



<1 



(41) 



The design for the Observer-Controller part reduces finally to solving the following 
optimization problem 



\W (l-N (l-RM )y)\\ 
subject to ||^ (M, (l - RM r ) Y, )|[ < 1 



mm 

Pk -Pi •"> 



(42) 



Direct search techniques - see (Powell, M., 1998) - are suggested for solving the problem (42). 
Basically they consist of a method for solving optimization problems that does not require 
any information about the gradient of the objective function. Unlike more traditional 
optimization methods that use information about the gradient or higher derivatives to 
search for an optimal point, a direct search algorithm searches a set of points around the 
current point, looking for one where the value of the objective function is lower than the 
value at the current point. At each step, the algorithm searches a set of points, called a mesh, 
around the current point — the point computed at the previous step of the algorithm. The 
mesh is formed by adding the current point to a scalar multiple of a set of vectors called a 
pattern. If the pattern search algorithm finds a point in the mesh that improves the objective 
function at the current point, the new point becomes the current point at the next step of the 
algorithm. In (Henrion, D., 2006) a recent application of direct search techniques for solving 
a specific control problem can be consulted. 



4.2 Step II: Design of the prefilter controller A.\ 

In this section we tackle the second step of our design. This step is aimed at designing a 
prefilter controller for meeting tracking specifications given in terms of a reference model. In 
order to achieve this goal a model matching problem is posed as in figure 13 
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Fig. 13. Model-matching problem arrangement for the design of K 2 
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where T is the specified reference model for the closed-loop dynamics. The idea is to 
make use of the general control framework introduced in section 3 and design K 2 so as to 

minimize the relation from r to e in an Ti sense. In doing so, we also want to have certain 

control over the amount of control effort employed to make the close loop resemble the 
model reference. The parameter (X will precisely play the role of enabling one to arrive at a 
compromise between the tracking quality and the amount of energy demanded by the 
controller, accommodating into the design this practical consideration. 

Note that in the nominal case, i.e., P = P, the prefilter controller K 2 sees just NR. 

Therefore, the relation from the reference to the output reads as 



y- 



N.RKr 



(43) 



It should be noted that the N and R have already been fixed as a result of applying the first 
step of the design. To include different and independent dynamics for the step response, we 
have to take advantage of the second degree of freedom that K n provides. From the overall 
scheme in figure 11 we can compute the transfer matrix function that relates the inputs 
[d r\ , i.e., the disturbance d and the command signal r, with the outputs [u ' e\ , 
i.e., the weighted control signal u ' and the weighted model matching error e . 



-W^X-M^Xy+RNX)) cxW l M r (\-RM r )Y r K 2 



N r (X + RNY r ) 



(N r RK 2 -T ref ) 



(44) 



The Ti x -norm of the complete transfer matrix function (44) is minimized to find K 2 without 

modifying neither the robust stability margins nor the disturbance rejection properties 
provided by the Observer Controller in the first step of the design. The 2-DOF design 
problem shown in figure 13 can be easily cast into the general control configuration seen in 
section 3. Comparing figures 12 and 13 with figure 9 we make the following pairings 

W 1 = d o , w 2 = r , z, = u ' , z 2 = e ' , v = (3 , u =u and K = K 2 . The augmented plant G and 
the controller K 2 are related by the following lower LFT: 



T l (G,K 2 ) = G u +G 12 K 2 (\-G 22 K 2 y'G 2l 
The corresponding partitioned generalized plant G is: 



_y_ ' _il 

G 21 | G 22. 



- Wl (\-M r (X r +RN r Y r j) 

N r (X r+ RN r Y r ) 




j W x M r R 
-a\ ef j aN r R 



(45) 



(46) 
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Remark. The reference signal r must be scaled by a constant W to make the closed-loop 

transfer function from r to the controlled output y match the desired reference model T 
exactly at steady-state. This is not guaranteed by the optimization which is aimed at 
minimizing the 7i -norm of the error. The required scaling is given by 

W r = (K 2 (0)N r (0)^(0))"' T ef (0) (47) 

Therefore, the resulting reference controller is K W . 

5. Illustrative example 

In this section we apply the methodology presented in section 4 for designing a 2-DOF 
controller according to figure 11 for the following nominal plant 

P- *' + U) (48) 

" (s + lXs+2) 

The uncertainty in the model is parameterised using an additive uncertainty description as 
in (38) with 

3 (* + 1 ) 

W = — (49) 

(s + 20) 

Now we initialize the optimization problem (42) with the values 
p K =[-10 -10] T ,p L =[-20 -20] r ,m=[l 2] T , where p KI) (p LI) ) contains the initial 

roots of the polynomials p ( p ) and m the initial coefficients for the polynomial m . 

For this example we have used a non-constrained direct search optimization solver and we 
have defined as the objective function the same that appears in the problem (42) plus a 
penalty that acts when the robust stability restriction is not satisfied. In order to have nearly 

perfect output disturbance rejection in steady state the following weight for the T , relation 

has been used 

W r = —*— (50) 

5 + 0.01 

this forces the direct search algorithm to provide small values for the T, magnitude 

response at dc. The optimization procedure results finally in the following optimal controller 
blocks for the feedback part of the design 

/+ 57.19s -112500 2.2710s +40190 0.05355s + 0.323 

y y v (^W 

s 2 + 35.67s + 245 ' s 2 + 35.67s + 245 ' ' s 2 + 24.53s + 148.2 
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or in terms of p , p , m : 

Pi =[-10.7774 -13. 15\9]\p L =[-9.2857 -26.3793]' ,m = [0.0535 0.3230]' (52) 




10 10 10 

Frequency (rad/seG) 

Fig. 14. Top: magnitude response of T , . Bottom: magnitude response of WT satisfying 
the robust stability condition. 

In figure 14 the finally resulting T . (output disturbance rejection) and WT d magnitude 

responses are shown, providing the former specially good disturbance rejection at steady 

state and being the 7i -norm of the latter less than one and thus ensuring the robust 

stability specification. 

So far we have obtained the final design for the Observer-Controller part ensuring robust 

stability and the best achievable output disturbance rejection in a Ti sense. We turn now to 

the design of the prefilter controller K . 

Let us assume that we are given the desired closed-loop dynamics in terms of the following 
reference model 



49 



(s + 7) 



(53) 



Such reference model (53) provides second order responses with time constant 1/7 seconds. 
The second step of the design explained in section 4 results in the prefilter block 



K, 



-0.006051s + 11.65s + 106.8s + 181. 7 
s 3 + 28.65/+ 184.8s + 508.3 



(54) 
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This prefilter block has been achieved using (X =6, see figure 17. 

In figure 15 it is shown that the value a =6 provides a tight model matching with the 
minimum possible control action. Larger values of the a parameter do not improve 
significantly the model matching and cause the control action to acquire higher values. 
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Fig. 15. From bottom to top (in solid), magnitude responses of T (top figure) and 
T (bottom figure) for (X =1,3,6. In dashed it is shown the response of the target model T ■ 




Frequency {rarjrsec} 
Fig. 16. Bode diagram for the original 11 th order prefilter K 2 (dashed) and the 3 rd order 

prefilter K l (solid) finally obtained by applying order reduction techniques. 

The use of the Ti optimization techniques traditionally results in very high order 

controllers. In this case, the resulting K is of order eleven. However, standard order 
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reduction techniques can be applied in order to reduce these orders. For this example, a 
model reduction based on a balanced realization and the hankel singular values - see 

(Skogestad S., 1997) - has been performed yielding finally a third order K 2 without 

sacrificing any significant performance, see figure 16. 

To summarize the carried out design, in figure 17 we show the closed-loop final response to 
a step command set-point change applied at t=0 seconds and a step output disturbance 
applied at t=3 seconds. 




0.5 1 



Fig. 17. Time response of the reference model T (dotted), nominal controlled system (solid) 

and uncertain (A = 0.25 in (38)) controlled system (dashed). It is also shown the response 
of the nominal controlled system without making use of the prefilter controller (x-marked). 



6. Conclusion 

A new 2-DOF control configuration based on a right coprime factorization of the model of 

the plant has been presented. The approach has been introduced as an alternative to the 

commonly encountered strategy of setting the two controllers arbitrarily, with internal 

stability the only restriction, and parameterizing the controller in terms of the Youla 

parameters. 

An non-minimal-observer-based state feedback control scheme has been designed first to 

guarantee some levels of robust stability and output disturbance rejection by solving a 

constrained Ti optimization problem for the poles of the right coprime factors 

X , Y , N ,M and the polynomial m . After that, a prefilter controller to adapt the reference 

command and improve the tracking properties has been designed using the generalized 
control framework introduced in section 3. 
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1. Introduction 

In this contribution, three nonlinear control strategies are presented for a two-degree-of- 
freedom parallel robot that is actuated by two pairs of pneumatic muscle actuators as 
depicted in Fig. 1. Pneumatic muscles are innovative tensile actuators consisting of a fibre- 
reinforced vulcanised rubber hose with appropriate connectors at both ends. The working 
principle is based on a rhombical fibre structure that leads to a muscle contraction in 
longitudinal direction when the pneumatic muscle is filled with compressed air. Pneumatic 
muscles are low cost actuators and offer several further advantages in comparison to 
classical pneumatic cylinders: significantly less weight, no stick-slip effects, insensitivity to 
dirty working environment, and a higher force-to-weight ratio. The achievable closed-loop 
performance using such actuators has already been investigated experimentally at a linear 
axis with a pair of antagonistically arranged pneumatic muscles (Aschemann & Hofer, 
2004). Current research activities concentrate on the use of pneumatic muscles as actuators 
for parallel robots, which are known for providing high stiffness, and especially for the 
capability of performing fast and highly accurate motions of the end-effector. The planar 
parallel robot under consideration is characterised by a closed-chain kinematic structure 
formed by four moving links and the robot base, which offers two degrees of freedom, see 
Fig. 1. All joints are revolute joints, two of which - the cranks - are actuated by a pair of 
pneumatic muscles, respectively. The coordinated contractions of a pair of pneumatic 
muscles are transformed into a rotation of the according crank by means of a toothed belt 
and a pulley. The mass flow rate of compressed air is provided by a separate proportional 
valve for each pneumatic muscle. 

The paper is structured as follows: first, a mathematical model of the mechatronic system is 
derived, which results in a symbolic nonlinear state space description. Second, a cascaded 
control structure is proposed: the control design for the inner control loops involves a 
decentralised pressure control for each pneumatic muscle with high bandwidth, whereas 
the design of the outer control loop deals with decoupling control of the two crank angles 
and the two mean pressures of both pairs of pneumatic muscles. For the inner control loops 
nonlinear pressure controls are designed taking advantage of differential flatness. For the 
outer control loop three alternative approaches have been investigated: flatness-based 
control, backstepping, and sliding-mode control. Third, to account for nonlinear friction as 



26 



New Approaches in Automation and Robotics 



well as model uncertainties, a nonlinear reduced order disturbance observer is used in a 
disturbance compensation scheme. Simulation results of the closed-loop system show 
excellent tracking performance and high steady-state accuracy. 




Fig. 1. Test rig. 

2. System modelling 

The modelling of the pneumatically driven parallel robot involves the mechanical 
subsystem and the pneumatic subsystem, which are coupled by the torques resulting from 
the tension forces of a pair of pneumatic muscles, respectively. 



2.1 Multibody model of the parallel robot 

The control-oriented multibody model of the parallel robot part consists of three rigid 
bodies (Fig. 2): the two cranks as actuated links with identical properties (mass tha, reduced 
mass moment of inertia w.r.t. the actuated axis Ja, centre of gravity distance sa to the centre 
of gravity, length of the link I a, pulley radius r) and the end-effector E (mass mg), which is 
modelled as lumped mass. 

■T.z 




Fig. 2. Multibody model of the parallel robot. 
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The inertia properties of the remaining two links with length lp, which are designed as light- 
weight construction, shall be neglected in comparison to the other links. The inertial xz- 
coordinate system is chosen in the middle of the straight line that connects both base joints. 
The motion of the parallel robot is completely described by two generalised coordinates qi(t) 
and q2(t) that denote the two crank angles, which are combined in the vector q = [cji, qi] T . 
Analogously, the vector of the end-effector coordinates is defined as r = [xe, Ze] t - 

k,= l 




Fig. 3. Ambiguity of the robot kinematics. 

The direct kinematics can be stated in symbolic form and describes the vector of end-effector 
coordinates r in terms of given crank angles q, i.e. 



r(q,fc 3 ). 



(1) 



Here, the configuration parameter k3 is introduced to cope with two possible configurations, 
see Fig. 3. The relationship between the corresponding velocities is obtained by 
differentiation with respect to time 



r = J(q^3)q- J(q^3) : 



dr(q,fc 3 ) 

5q r 



(2) 



where J(q, /C3) denotes the corresponding Jacobian. Here, singularities in the Jacobian can be 
avoided by model-based trajectory planning. Analogously, the acceleration relationship is 
given by 



r = J(q^3)q + J(q^3)q- 



(3) 



For a given end-effector position r the corresponding crank angles follow from the inverse 
kinematics 



q = q(r,/c 1 ,fc 2 ), 



(4) 
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which can be determined in symbolic form. The given ambiguity is taken into account by 
introducing two configuration parameters fci and ki as shown in Fig. 3. The relationships 
between the corresponding velocities as well as the accelerations follow from direct 
kinematics 



q = J- 1 (r,fc 3 )r, 

q = r 1 (q,fc 3 )[r-j(q,fc 3 )q]. 



(5) 




Fig. 4. Free-body diagram of the parallel robot. 

The equations of motion for the actuated links can be directly derived from the free-body 
diagram in Fig. 4 applying the principle of angular momentum 



J A ■ h = r J F MV- F Mlr] - m A -g-S A - COSIfa + F 1E • l A ■ SUlfa + rj x , 

J A -q 2 =r-[F ml -F mr ]-m A -g-$ A -cosq 2 -F2£-*A'Sin#i + 72 ' 



(6) 



Here, the driving torque t, of drive i depends on the corresponding muscle forces, i.e. t, = r 
[Fmu ~ Fuir\- At this, the indices of all variables describing a particular pneumatic muscle are 
chosen as follows: the first index i = {1, 2} denotes the drive under consideration, described 
by the generalised coordinate Cji(t), whereas the second index j = {/, r) stands for the 
mounting position, i.e. for the left or the right pneumatic muscle. The disturbance torque rji 
accounts for friction effects as well as remaining uncertainties in the muscle force 
characteristics (13) of drive i, respectively. The coupling forces Fie and F2E are obtained from 
Newton's second law applied to the end-effector 



w*e-(£ + Ze) 



cosy-t -cos^2 
sin;^ sin/2 



ME 
F 2E 



(7) 



The equations of motion in minimal form for the crank angles can be derived in two steps. 
First, the last equation has to be solved for the unknown forces 
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COS^j -cos ^2 
sin y-y sin y% 



»E'*E 



m E-(g + ZE) 



(8) 



which then can be eliminated in (6). Second, the substitution of the variables ft = f;(q), /?; = 
/?,(q), and (3) resulting from direct kinematics leads to the envisaged minimal form of the 
equations of motion 



M(q)q + k(q,q) + G(q) = Q(q), 



(9) 



with the mass matrix M(q), the vector of centrifugal and Coriolis terms k(q,q) and the 
vector of gravity torques G(q). The vector of generalised torques Q(q) contains the 
corresponding muscle forces times the radius r of the pulley 



Q(q) = r 






(10) 



Note that this minimal form of the equations of motions is not compulsory. Instead the 
corresponding system of differential-algebraic equations can be utilised as well for the 
flatness-based control design. 

2.2 Modelling of the pneumatic subsystem 

The parallel robot is equipped with four pneumatic muscle actuators. The contraction 
lengths of the pneumatic muscles are related to the generalised coordinates, i.e. the crank 
angles qi. The position of the crank angle, where the corresponding right pneumatic muscle 
is fully contracted, is denoted by qto. Consequently, by considering the transmission 
consisting of toothed belt and pulley, the following constraints hold for the contraction 
lengths of the muscles 



A ^M>7 (<?!') = 



r-{q 

: "*■ M.max 



to)/ 



->"•(<?;- <?;o)- 



(11) 



Here, Afjvtmax is the maximum contraction given by 25% of the uncontracted length. 

The volume characteristic of the pneumatic muscle (Fig. 5) can be approximated with high 

accuracy by the following nonlinear function of both contraction length and muscle 

pressure, where the coefficients in this polynomial approximation have been identified by 

measurements 



y{^myVmi)= Z («m-^M/)' Z(fc„ -VMij)- 



(12) 



m=0 



n=0 



The force characteristic Fmij (pMij ,^Mij) of the pneumatic muscle shown in Fig. 6 describes the 
resulting static tension force for given internal pressure pMij as well as given contraction 
length A€m>; ■ This nonlinear force characteristic has been identified by static measurements 
and, then, approximated by the following polynomial description 



r Uij 



^Mij{^ ( Mij)-VMxj-fMxj{^Mxj)= Z { c m-^Mij)-PMi ~ £ (<*n ' ^NUj) ■ ( 13 ) 



m=0 



n=0 
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Fig. 5. Volume characteristic of a pneumatic muscle. 
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Fig. 6. Force characteristic of a pneumatic muscle. 

The dynamics of the internal muscle pressure follows directly from a mass flow balance in 
combination with the pressure-density relationship. As the maximum internal muscle 
pressure is limited by a maximum value of p max = 7 bar, the ideal gas equation can be utilised 
as accurate description of the thermodynamic behaviour of the compressed air 



Pa® 



-RT 



Mij ' PMij ■ 



(14) 



Here, the density pMi/ , the gas constant R of air, and the thermodynamic temperature Tuij 
are introduced. For the thermodynamic process a polytropic change of state is assumed. 
Thus, the relationship between the time derivative of the pressure and the time derivative of 
the density results in 



PMij 



-n-R-T 



Mij ' PMij ■ 



(15) 
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The mass flow balance for the pneumatic muscle is governed by 

PMij = 7 T ™Mij - p Mij ■ V Mij [Ai Mi j,p Mi j ) . (16) 

VMij[ M M,j>PMij) L J 

The resulting pressure dynamics is given by a nonlinear first order differential equation and 
shall not be neglected as in (Carbonell et. al., 2001) 



VMlj .. 8V Mlj 

V Mii + n ' ^T PMij 

dfMij 



dV Mij dA£ f 

9 Mij 



'Mij u ^Mij 

8M, 



R ■ T?Aij ■ m Mij - -r-r^ — - ■ PMij ■ Hi 



(17) 



The internal temperature Tuij can be approximated with good accuracy by the constant 
temperature To of the ambiance (Gottert, 2004). Thereby, temperature measurements can be 
avoided, and the implementational effort is significantly reduced. 

3. Control design based on differential flatness 

A nonlinear system in state space notation is denoted as differentially flat (Fliess et. al., 
1995), if flat outputs 

y = y(x,u,ii,...,u (a) ), dim(y) = dim(u) (18) 

exist that allow for expressing all system states x and all system inputs u in the form 

x = x(y,y,...,y (/;) ), 

u = u(y,y y^ +1 >). { ' 

As a result, offline trajectory planning considering state and input constraints become 
possible. Moreover, the stated parametrization of the complete system dynamics by the flat 
outputs can be exploited for pure feedforward control as well as combined feedforward and 
feedback control. 

3.1 Flatness-based pressure control 

The nonlinear state equation (17) for the internal muscle pressure pMij represents the basis 
for the decentralized pressure control. It can be re-formulated as 

PMii=- k pij{ M Mij>^Mii>PMij)-PMij+ k uij{ M Mij>PMij)-™Mij ■ ( 20 ) 

With the internal muscle pressure as flat output candidate ytjp = pMij , (20) can be solved for 
the mass flow m Mi : as control input Uij v and leads to the inverse model for the pressure 

control 

%,/=" 1 \-\ v ij + k vij{ Ai mM Mij 'P Mij) ■PMij]' ( 21 ) 

hij[M Mij 'PMij) L J 
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Since the internal pressure puij as state variable is identical to the flat output and dimtyijp) = 
dim(Myp) = 1 holds, the differential flatness property is proven. The contraction length AImij 
as well as its time derivative can be considered as scheduling parameters in a gain- 
scheduled adaptation of fc„,y and k p ij . With the internal pressure as flat output, its first time 
derivative is introduced as new control input 

PMij = v ij ■ ( 22 ) 

Consequently, the state variable of the corresponding Brunovsky form has to be provided 
by means of measurements, i.e. z,y p = pMij ■ Each pneumatic muscle is equipped with a 
pressure transducer mounted at the connection flange that connects the muscle with the 
toothed belt. For the contraction length and its time derivative either measured or desired 
values can be employed: in the given implementation, the scheduling parameter Aif^ij results 
from the measured crank angle c\i, which is obtained by an encoder providing high 
resolution. Furthermore, the second scheduling parameter, the contraction velocity, is 
derived from the crank angle qi by means of real differentiation using a DTi-System with the 
corresponding transfer function 

Gdti(s) = ^-t- (23) 

T t ■ s + 1 

The error dynamics of each muscle pressure pMij can be asymptotically stabilised by the 
following control law which is evaluated with the measured pressure. Using this control law 
all nonlinearities are compensated for. An asymptotically stable error dynamics is obtained 
by pole placement 



v ij ~ PMijd + a \o(V Mijd ~ PMij) 



e pij + a w ■ e pij = , (24) 



where the constant aw is determined by pole placement. Here, the desired value for the time 
derivative of the internal muscle pressure can be obtained either by real differentiation of 
the corresponding control input u,y in (33) or by model-based calculation using only desired 
values, i.e. 

PMijd = PMijd( T '*''*'PMid'PMid) ■ (25) 

The corresponding desired trajectories are obtained from a trajectory planning module that 
provides synchronous time optimal trajectories according to given kinematic and dynamic 
constraints (Aschemann & Hofer, 2005). It becomes obvious that a continuous time 
derivative p Ml .y requires a three times continuously differentiable desired end-effector 

trajectory r. 

The implementation of the underlying flatness-based pressure control structure for drive i is 
depicted in Fig. 7. In each input channel, the nonlinear valve characteristic (VC) is 
compensated by pre-multiplying with its approximated inverse valve characteristic (IVC). 
This inverse valve characteristic is implemented as look-up-table and depends both on the 
commanded mass flow and on the measured internal pressure. 



Nonlinear Model-Based Control of a Parallel Robot Driven by Pneumatic Muscle Actuators 



33 



P\tiM 

Ptfild 



Error dynamics 

left muscle 

■pressure 



PMil 



[M m „ M„iiY 



Real differentiation 



M, 



Vmu 



v ilp — PMil 



Pl&T 



Pilird 
_ P.Wird 



Inverse 
dynamics 



Hip 



A m 



mi 



vc 



m Mi i 



"trp 



4- 

A IVC 



vc 



PMh 



[«- M'u, r ] T 



Error dynamics 

right muscle 

pressure 



Real differentiation 



A', 



PMir 



S •- 

I" 



PMil 



PMir 



Fig. 7. Implementation of the underlying pressure control structure for drive i. 

3.2 Inverse dynamics of the decoupling control 

For the outer control loop design the generalised coordinates and the mean muscle 
pressures are chosen as flat output candidates 



y = y(x,u) = 



where the input vector u contains the four muscle pressures 



<?2 

Pmi 

PM2 


= 


<7i 

<?2 

Pmi; + PMir 


2 
Pm2; + Pm2i- 



(26) 



u -|_Pmi; PMir PM2; PM2rJ 



(27) 



and the state vector x consists of the vector of generalised coordinates q as well as their time 
derivatives q 



(28) 



The trajectory control of the mean pressure allows for increasing stiffness concerning 
disturbance forces acting on the end-effector (Bindel et. al., 1999). As the decentralised 
pressure controls have been assigned a high bandwidth, these four controlled muscle 



34 



New Approaches in Automation and Robotics 



pressures pMij can be considered as ideal control inputs of the outer control loop. Subsequent 
differentiation of the first two flat output candidates until one of the control appears leads to 



and 



2/1 =<h' 

2/2=<?2' 

i/ 2 =^ 2 (q,q,p M 2,,p M2r ), 



(29) 



(30) 



whereas the third and fourth flat output candidates directly depend on the control inputs 



J/3=PMl=°- 5 -(PMl/+PMlr)' 
J/ 4 = PM2 = °-5 ■ (PM2l + PM2r)- 



(31) 



The differential flatness can be proven as follows: all system states can be directly expressed 
by the flat outputs and their time derivatives 



= [j/l J/2 J/l J/2] ■ 



(32) 



The equations of motion (9) are available in symbolic form. Inserting the muscle force 
characteristics, the internal muscle pressures as control inputs can be parameterized by the 
flat outputs and their time derivatives 



Pmi(<iA><i>PMiY 

PMi r (q>q>q>PMi) 
PM2/(q>q>q>fM2) 

PM2r(q/q/q/PM2) 



: u(q,q,q,p M1 ,p M2 ). 



(33) 



In the following, three different nonlinear control approaches are employed to stabilize the 
error dynamics of the outer control loop: flatness-based control, backstepping and sliding- 
mode control (Khalil, 1996). For all these alternative designs, the differential flatness 
property proves advantageous (Sira-Ramirez & Llanes-Santiago, 1995; Aschemann et. al., 
2007). 



3.3 Flatness-based control 

In the case of flatness-based control, the inverse dynamics is evaluated with the measured 
crank angles and the corresponding angular velocities obtained by real differentiation 
(Aschemann & Hofer, 2005). For the mean pressures, however, desired values are utilized. 
The second derivatives of the crank angles, the angular accelerations, serve as stabilizing 
inputs 

U =[PM1/ VtAlr PM21 PM2r] = *{ c lA> v \> v 2>P Mld'P M2d) ■ ( 34 ) 

The inverse dynamics leads to a compensation of all nonlinearities. An asymptotic 
stabilization is achieved by pole placement with Hurwitz-polynomials for the error 
dynamics for each drive i = {1, 2} 
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t 

»i=ta + «»2-fe-<Ji) + «,r(<?<<i-<?i) + {«io'(<?id-<?«) dr - ( 35 ) 



3.4 Backstepping control 

The first step of the backstepping control design (Khalil, 1996) involves the definition of the 
tracking error variable for each drive i = {1, 2), 

e i\=Hid-Hi => e il =q id -q i . (36) 

Next, a first Lyapunov function Vn is introduced 

1 • ! 

v n( e ii) = - e n> => V t (e n ) = e i\- en =e n -{q id -<]i)=-c x -e\ (37) 

and the expression for its time derivative is solved for the virtual control input 

ei\=c\iA-c{i=-C\-e n => qi~a a (e i -i,q id ) = q ii +c 1 -e a . (38) 

In the second step, the error variable en is defined in the following form 

e /2 =«;i( e ;i><7;d) -<?;=<? ;d -<?.+ c i- e ;i => e il =e i2 -c 1 -e n (39) 

and its time derivative is computed 

e,i=<iid -<ii+ c i-en=iiid -iji+ c i -(ea-ci-en) ■ (40) 

Now, a second Lyapunov function Vq is specified. 

1 2 1 2 
Vi2(en,e i2 )=-e il +-e i 2>0 => V i2 (e n ,e l2 ) = e n -e n +e i2 -e i2 (41) 

The corresponding time derivative 

l 

Vi 2 {eii,e i2 ) = -c 1 -ef 1 +e i2 ■\q i d- v i+ c \\ e n -ci •e i i) + e,- 1 ]=-c 1 -e\ -c 2 -ef 2 (42) 
can be made negative definite by choosing the stabilizing control input as follows 

Vi = iji = ijid + e n ■ (1 -cl) + e i2 ■ (c a + c 2 ) . (43) 

Backstepping control design offers several advantages in comparison to flatness based 
control. It becomes possible to avoid cancellations of useful, i.e. stabilizing nonlinearities. 
Furthermore, different positive definite functions can be used at control design, e.g. 
allowing for nonlinear damping. 

3.5 Sliding-mode control 

For sliding-mode control (Sira-Ramirez & Llanes-Santiago, 1995) the vector of tracking 
errors is considered 



Hid -Hi 

Hid -Hi 



(44) 
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Based on this error vector z, , the following sliding surfaces Si are defined for each drive 
i = {1, 2} 



Si(z i ) = c] id ~cj i +/] il -(q id -cj i ) 



-■'iu-'ii+Pa'{<lid-cii)> 



(45) 



where pa represents a positive gain. The convergence to the corresponding sliding surface is 
achieved by introducing a discontinuous switching function in the time derivative of a 
quadratic Lyapunov function 



Vi{Si) 



2 ' 



Vi{Si)- 



-a l s l sign{s i ), 



(46) 



with a properly chosen coefficient a, that dominates remaining model uncertainties. The 
control design offers flexibility as regards the choice of the sliding surfaces and the reaching 
laws. For the implementation, however, a smooth switching function is preferred to reduce 
high frequency chattering. This results in the following stabilizing control law, which leads 
to a real sliding mode within a boundary layer 



: Hi = Hid + Pi\ ■ (<iid -1i) + a i ■ tanh(-L) . 

£ 



(47) 



The implemented control structure is depicted in Fig. 8. The desired trajectories are 
provided from an offline trajectory planning module that calculates time optimal trajec- 
tories according to both state constraints and input constraints. This is achieved by proper 
time-scaling of polynomial functions with free parameters as described in (Aschemann & 
Hofer, 2005). 
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Fig. 8. Implementation of the decoupling control structure. 



4. Disturbance observer design 

The observer provides a vector x 2 of estimated disturbance torques that accounts for both 
model uncertainties and nonlinear friction. The main idea consists in the extension of the 
system state equations with the measurable state vector 
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y=x 1 =[q,qf 
by two integrators, which serve as disturbance models (Aschemann et. al., 2007) 

y=f(y,x 2 ,u), dim(y) = 4, 
x 2 =0, dim(x 2 ) = 2. 

The reduced-order disturbance observer according to (Friedland, 1996) is given by 

z = <l>(y,x 2 ,u), dim(z) = 2, 



x 2 



72 



Hy + z, 



(48) 



(49) 



(50) 



where H denotes the observer gain matrix and z the observer state vector. The observer gain 
matrix is chosen as follows 



H = 









''22 



(51) 



involving only two design parameters h\\ and hn. Aiming at an asymptotically stable 
observer dynamics 



lime = lim(x 2 -x 2 )=0 , 



(52) 



the observer gains are determined by pole placement based on a linearization using the 
corresponding Jacobian (Friedland, 1996). In Fig. 9 a comparison of simulated disturbance 
forces and the observed forces provided by the proposed disturbance observer is shown. 
Here, the resulting tangential force at the pulley with radius r is depicted, which is related to 
the disturbance torque according to F iu =r}j/r . Obviously, the simulated disturbance 

forces are reconstructed with high accuracy. 
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Fig. 9. Comparison of simulated disturbance force and observed disturbance force using the 
reduced-order disturbance observer. 
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5. Simulation results 

The efficiency of the proposed cascade control structure is investigated using the desired 
trajectory shown in Fig. 10 with maximum velocities of approx. 0.9 m/s and maximum 
accelerations of approx. 7 m/s 2 for each axis. 

The first part of the desired trajectory involves the motion on a quarter-circle with the radius 
0.2 m from the starting point (x = m, z = 1 m) to the point (x = -0.2 m, z = 0.8 m). The next 
three movements consist of straight lines: the second part comprises a diagonal movement 
in the xz-plane to the point (x = -0.1 m, z = 0.6 m), followed by a straight line motion in in- 
direction to the point (x = 0.1 m, Z = 0.6 m). The fourth part is given by a diagonal movement 
to the point (x = 0.2 m, z = 0.8 m). The fifth part involves the return motion on a quarter- 
circle to the starting point (x = m, z = 1 m). 
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Fig. 10. Desired trajectory in the workspace. 
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Fig. 11 shows a comparison of the resulting tracking errors in the workspace for flatness- 
based control (FB), backstepping control (BS) and sliding-mode control (SM). Without 
observer-based disturbance compensation, the best results are obtained using sliding-mode 
control. 

The efficiency of the observer based disturbance compensation is emphasized by Fig. 12. For 
all considered control approaches a further improvement of tracking accuracy is achieved. 

6. Conclusion 

In this contribution, a cascaded trajectory control based on differential flatness is presented 
for a parallel robot with two degrees of freedom driven by pneumatic muscles. The 
modelling of this mechatronic system leads to a system of nonlinear differential equations of 
eighth order. For the characteristics of the pneumatic muscles polynomials serve as good 
approximations. The inner control loops of the cascade involve a flatness-based control of 
the internal muscle pressure with high bandwidth. For the outer control loop three different 
control approaches have been investigated leading to a decoupling of the crank angles and 
the mean pressures as controlled variables. Simulation results emphasize the excellent 
closed-loop performance with maximum position errors of approx. 1 mm during the 
movements, vanishing steady-state position error and steady-state pressure error of less 
than 0.03 bar, which have been confirmed by first experimental results at a prototype 
system. 
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Fig. 12. Tracking errors in the workspace with observer-based disturbance compensation. 
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1. Introduction 

Recent developments in robotics have revealed a strong demand for autonomous out-door 
vehicles capable of some degree of self-sufficiency. These vehicles have many applications in 
a large variety of domains, from spatial exploration to handling material, and from military 
tasks to people transportation (Azouaoui &Chohra, 1998; Hong et al., 2002; Kujawski, 1995; 
Labakhua et al., 2006; Niegel, 1995; Schafer, 2005; Schilling & Jungius, 1995; Wagner, 2006). 
Most mobile robot missions include autonomous navigation. Thus, vehicle designers search 
to create dynamic systems able to navigate and achieve intelligent behaviors like human in 
real dynamic environments where conditions are laborious. 

In this context, these last few years small automated and non-pollutant vehicles are 
developed to perform a public urban transportation task. These vehicles must use advanced 
control techniques for navigation in dynamic environments especially urban ones. Indeed, 
several research works have recently emerged to treat this transportation task problem. For 
instance, the work developed in (Gu & Hu, 2002) presents a path-tracking scheme based on 
wavelet neural predictive control to model non-linear kinematics of the robot to adapt it to a 
large operating range. In (Mendes et al., 2003), a path-tracking controller with an anti- 
collision behavior of a car-like robot is presented. It is based on navigation and anti-collision 
systems. The first system uses a Fuzzy Logic (FL) to implement the path-tracking while the 
second system consists of estimating the trajectories and behavior of surrounding objects. 
Another work developed in (Bento & Nunes, 2004) treats also the path following problem of 
a cybernetic car. The developed controller with magnetic markers guidance is based on FL 
and integrates an anti-collision behavior applied to a bi-steerable vehicle. Other works use a 
visual control to achieve a desired task such as the work proposed in (Avina Cervantes, 
2005). It consists to develop a visual-based navigation method for mobile robots using an 
on-board color camera. The objective is the use of vehicles in agriculture to navigate 
automatically on a network of roads (to go from a farm to a given field for example). 
Although several investigations on the robot navigation problem have been developed 
(Avina Cervantes, 2005; Azouaoui & Chohra, 2002; Chohra et al., 1998; Gu & Hu, 2002; 
Kujawski, 1995; Labakhua et al, 2006; Mendes et al., 2003; Niegel, 1995; Schilling & Jungius, 
1995; Sorouchyari, 1989), to date further efforts must be made to apprehend and understand 
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the navigation behavior of a vehicle evolving in partially structured and partially known 
environments such as urban ones. 

In this paper, an interesting neural-based navigation approach suggested in (Azouaoui & 
Chohra, 2002; Chohra et al., 1998) is applied with some modifications to a bi-steerable 
mobile robot Robucar. Indeed, this approach is based on basic behaviors which are fused 
under a neural paradigm using Gradient Back-Propagation (GBP) learning algorithm. This 
navigation is then implemented within a behavioral architecture because of its excellent 
real-time execution properties (Murphy, 2001). 

The aim of this work is to implement a neural-based navigation approach able to provide 
the Robucar with more autonomy, intelligence, and real-time processing capabilities. In fact, 
the vehicle relies on its interaction with its environment to extract useful information. In this 
paper, the used neural navigation approach essentially based on pattern classification (or 
recognition) (Welstead, 1994) of target localization and obstacle avoidance behaviors is 
presented. This approach has been developed in (Chohra et al., 1998) for five (05) possible 
movements of vehicles, while in this paper this number is reduced to three (03) possible 
movements due to the Robucar structure. Second, simulation results of the neural-based 
navigation are presented. Finally, an implementation of the neural-based navigation on a 
real bi-steerable robot Robucar is given leading to a learning vehicle able to behave 
intelligently faced to unexpected situations. 

2. Neural-based navigation approach applied to a bi-steerable mobile robot 
Robucar in partially structured environnments 

To navigate in partially structured environments, the Robucar must reach its target without 
collisions with possibly encountered obstacles. In other terms, it must have the capability to 
achieve the target localization, obstacle avoidance, and decision-making and action 
behaviors. These behaviors are acquired using multilayer feedforward Neural Networks 
(NN). 

This neural navigation is built of three (03) phases as shown in Figure 1. During the phase 1, 
from the temperature field vector Xt, the robot learns to recognize target location situations 
Tji (jl = 1, ..., 5) classifier while it learns to recognize obstacle avoidance situations Oj2 (]2 = 1, 
..., 6) classifier from the distance vector Xo during the phase 2. The phase 3 decides an action 
Ai (i = 1, ..., 3) from two (02) association stages and their coordination carried out by 
reinforcement Trial and Error learning. 



PHASE1 



Xt 



Target Localization ^ T 
(Ml Classifier) \ ». 



} 



PHASE2 

Obstacle Avoidance | O 
(NN2 Classifier) \ ^ 



} 



PHASE3 



Decision -Making and Action (NN3) X 
Association 



Coordination 



Association 



Fig. 1. Neural navigation system synopsis. 
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2.1 Vehicule and sensor 

a) Vehicle. 

The Robucar is a non-holonomic robot characterized by its bounded steering angle (-18°< ty< 
+18°) and velocity (Om/s <v <5m/s) (Figure 2(a)). Three movements of the Robucar are 
defined to ensure safety displacement in the environment. The possible movements are then 
in three (03) directions consequently three (03) possible actions are defined as action to move 
left (towards 18°), action to move forward (towards 0°), and action to move right (towards - 
18°) as shown in Figure 2(b). They are expressed by the action vector A = [Ai, A2, A3]. 




990 81° 

\ A 2 / 

A3 \ I / A! 


135 & ^ \ \ m 


/ 


/ ^ 






3 




Rotjucai 




270° 



(a) Robucar. (b) Robot model. 

Fig. 2. Robucar and its sensor, 
b) Sensor. 

The perception system is essentially based on a laser-range finder LMS200 ( SICK, 2001). It 
provides either 100° or 180° coverage with 0.25°, 0.5°, or 1.0° angular resolution. In this 
paper, the overall coverage area is divided into three sub-areas corresponding to the three 
possible actions as shown in Figure 2. Thus, to detect possibly encountered obstacles, three 
(03) areas are defined to get distances (vehicle-obstacle) from 45° to 81°, from 81° to 99°, and 
from 99° to 135° ( see Figure 2). These areas are deduced from the Robucar dimensions to 
ensure its security. 



2.2 Neural-based navigation system 

During the navigation, the vehicle must build an implicit internal map (i.e., target, obstacles, 
and free spaces) allowing recognition of both target location and obstacle avoidance 
situations. Then, it decides the appropriate action from two association stages and their 
coordination (Chohra et al., 1998; Sorouchyari, 1989). To achieve this, the neural-based 
navigation system is used where the only known data are initial and final (i.e., target) 
positions of the vehicle, 
a) Phase 1. 

Target Localization (NN1 Classifier). The target localization behavior is based on NN1 
classifier trained by the GBP algorithm which must recognize five (05) target location 
situations, after learning, from data obtained by computing distance and orientation of 
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robot- target using a temperature field method (Sorouchyari, 1989). This method leads to 
model the vehicle environment in five (05) areas corresponding to all target locations as 
shown in Figure 3. These situations are defined with five (05) Classes Ti, ..., Tp, ..., T5 where 
(jl = l,...,5): 

If 45° < y < 81° (Class Ti), 

If 81° < y < 99° (Class T 2 ), 

If 99° < y < 135° (Class T 3 ), 
If 135° < y < 270° (Class T 4 ), 

If 270° < y < 405° (Class T 5 ). (1) 

where y is the angle of the target direction. 



99° \ 

T3 .'' \ 

\ 

135°V"--..^ \ 


T 2 

A 2 
\ 


/ 


A Nrp 

V 

/ yr A i t --•" 
/ ^..-V 45° 


1 -c 


"" ^,? 


^ 1 


\ 


AM 


3 / 


T 4 \ 
\ 

' — 


Rotjuca 

< 1 y 
' l ^ 

370° 





Fig. 3. Target location situations. 

Temperatures in the neighborhood of the vehicle are defined by: tl in direction 18°, tF in 
direction 0°, and tr in direction -18°. These temperatures are computed using sine and cosine 
functions as follows: 

If 45°< y<80° (Class Ti), 

Then Tr = 12sin (y), Tf = 6cos (y), Tl = 6cos (y), 
If 80°< y<99° (Class T 2 ), 

Then Tr = 6 | cos (y) | ,Tp = 12sin (y),Tl = 6 | cos (y) | , 
If 99°< y<135° (Class T 3 ), 

Then T R = 6 | cos (y) | ,T F = 6 | sin (y) | ,T L = 12sin(y), 
If 135°< y<270° (Class T 4 ), 

Then Tr = 12 1 sin (y) | ,T F = 6 | sin (y) | ,T L = 12 | sin(y) | , 
If 270°< y<315° (Class T 5 ), 

Then Tr = 12 1 sin (y) | ,Tf = 6 | sin (y) | ,Tl = 6cos(y), 
If 315°< y<360° (Class T 5 ), 

Then Tr = 12cos (y),Tf = 6cos (y),Tl = 6 | sin(y) | , 
If 360°<y<405° (Class T 5 ), 

Then Tr = 12cos (y),T F = 6cos (y), Tl = 6sin(y). (2) 
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These components are pre-processed to constitute the input vector Xt of NN1 (Azouaoui & 
Chohra, 2003; Azouaoui & Chohra, 2002; Chohra et al., 1998) built of input layer, hidden 
layer, and output layer as shown in Figure 4: architecture of NN1 where Xj = X-n (i = 1, ..., 3), 
Y k (k = 1, ..., 5), q = Tji(j =]! = !, ...,5). 
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Fig. 4. Architecture of both NN1 and NN2. 

After learning, for each input vector Xt, NN1 provides the vehicle with capability to decide 

its target localization, recognizing target location situation expressed by the highly activated 

output Tji. 

b) Phase 2. 

Obstacle Avoidance (NN2 Classifier). The obstacle avoidance behavior is based on NN2 

classifier trained by GBP which must recognize obstacle avoidance situations, after learning, 

from laser sensor data giving robot-obstacle distances. These obstacle avoidance situations 

are modeled as the human perceives them, that is, as topological situations: corridors, 

rooms, right turns, etc. ( Anderson, 1995; Azouaoui & Chohra, 2003). 

The possible movements of the Robucar lead us to structure possibly encountered obstacles 

in six (06) topological situations as shown in Figure 5. These situations are defined with six 

(06) Classes Oi, ..., O j2 , ..., 6 where (j2 = 1, ..., 6). 

The robot-obstacle minimal distances are defined in the vehicle neighborhood by: dL in 

direction 18°, dp in direction 0°, and dp> in direction -18° as shown in Figure 6. These 

components are pre-processed to constitute the input vector Xo of NN2 built of input layer, 

hidden layer, and output layer as shown in Figure 4: architecture of NN2 where Xj = Xoi (i = 

1, ..., 3), Y k (k = 1, ..., 6), q = O j2 (j = j2 = 1, ..., 6). 
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Fig. 5. Obstacle avoidance situations. 

After learning, for each input vector Xo, NN2 provides the vehicle with capability to decide 
its obstacle avoidance, recognizing obstacle avoidance situation expressed by the highly 
activated output Oj2. 
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Fig. 6. Laser range areas for obstacle detection. 

c) Phase 3. 

Decision-Making and Action (NN3). Two (02) association stages between each behavior 

(target localization and obstacle avoidance) and the favorable actions (among possible 

actions), and the coordination of these association stages are carried out by NN3. Thus, both 

situations Tp and Oj2 are associated by the reinforcement trial and error learning with the 

favorable actions separately as suggested in (Sorouchyari, 1989). Afterwards, the 

coordination of the two (02) associated stages allows the decision-making of the appropriate 

action. 

NN3 is built of two layers (input layer and output layer) illustrated in Figure 7. 
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1) Input Layer. 

This layer is the input layer with eleven (11) input nodes receiving the components of Tji 
and Oj2 vectors. This layer transmits these inputs to all nodes of the next layer. Each node Tji 
is connected to all nodes Ai with the connection weights Ujji and each node Oj2 is connected 
to all nodes Aj with the connection weights Vij2 as shown in Figure 7. 

2) Output Layer. 

This layer is the output layer with three (03) output nodes which are obtained by adding the 
contribution of each behavior. The Robucar learns through trial and error interactions with 
the environment. It learns a given behavior by being told how well or how badly it is 
performing as it acts in each given situation. As feedback, it receives a single information 
item from the environment. The feedback is interpreted as positive or negative scalar 
reinforcement. The goal of the learning system is to maximize positive reinforcement 
(reward) and/ or minimize negative reinforcement (punishment) over time (Sorouchyari, 
1989; Sutton & Barto, 1998). By successive trials and/or errors, the Robucar determines a 
mapping function (see figure 8) which is used for its navigation. The two association stages 
are obtained as developed in (Chohra et al., 1998). 

After learning, NN3 provides the vehicle with capability to decide the appropriate action 
expressed by the highly activated output Ai. 

3. Simulation results 

In this section, at first the training processes of NN1, NN2, and NN3 are described. Second, 
the simulated neural-based navigation is described and simulation results are presented. 



Input Layer 






r\ Ti 






^VvUiji 






• \x s \ 






• W. N, 






* TiAVX 






Q^_\\ 


\. Output Layer 


• >/ N \. \ 




A, 


• t 5 V3, 




H 1 k 


\^ J w 






• 






• 






* A, 


Vij^xT 




^/il k 


f\. J 


r^^^/ 




m 

* A 3 






^^ i ^ 


^ ) w 


®r~// 






• / / s^ 


with 


i= 1, ..., 3 


•o 6 /// 


jl = 
J2 = 


= 1, ..., 5 
1, ...,6. 



Fig. 7. Architecture of NN3. 
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3.1 Training of NN1, NN2, and NN3 

a) Training of NN1. The used training set is composed of one hundred and nine (109) 
patterns corresponding to the five (05) target location situations. NN1 classifier yields 
convergence to the tolerance Eai = 0.06 in well with the learning rate ni = 0.1. 

b) Training of NN2. The used training set is composed of one hundred and fourteen (115) 
patterns corresponding to the six (06) obstacle avoidance situations. NN2 classifier yields 
convergence to the tolerance Ea2 = 0.16 in well with the learning rate r|2= 0.4. 

c) Training of NN3. This training is achieved with the training of two association stages and 
their coordination. 

1) Association. 

In this stage, the training to obtain the weights Ujji and Vij2/ constituting the training of NN3, 

is achieved respectively in an obstacle-free environment (i.e., O = 0) for the target 

localization behavior and without considering the temperature field (i.e., T = 0) for the 

obstacle avoidance behavior. 

The training results are illustrated in Figure 8 where the weights Uiji and Vij2 are adjusted to 

obtain the reinforced actions among favorable actions. Matrices of the two behaviors are 




►reinforced action Qdissociated action 
(a) Target localization matrix. 

A, A 2 A , 





reinforced action (_) dissociated action 
(b) Obstacle avoidance matrix. 



Fig. 8. Association matrices. 
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represented in this figure: solid circles correspond to positive weights which represent 
favorable actions, indicating reinforced association, where values are proportional to the 
area of circles and the most reinforced action is the one having the great positive weight. 
Hollow circles correspond to negative weights which represent dissociated actions. 
The choice of the most reinforced action is guided by the principle that the vehicle must 
avoid obstacles just to avoid collisions for the obstacle avoidance behavior and it must take 
the straighter action towards its target for the target localization behavior. 
2) Coordination. 

The coordination of the two association stages is conducted by the fact that actions 
generated by obstacle avoidance have precedence over those generated by target 
localization. In fact, the detection of the maximum temperature is interpreted as the goal of 
the vehicle while the generated actions by the presence of obstacles are interpreted as the 
reflex of the vehicle. 



3.2 Simulation of the neural-based navigation on the Robucar 

To reflect the vehicle behaviors acquired by learning, the Robucar navigation is simulated in 
different static and dynamic partially structured environments. The simulated vehicle has 
only two known data: its initial and final (target) positions. From these data, it must reach 
its target while avoiding possibly encountered obstacles using the neural-based navigation 
approach. 

Tested in the environment of Figure 9 corresponding to a corridor of our centre CDTA 
(Centre de Developpement des Technologies Avancees), the vehicle succeeds to avoid walls 
and obstacles by choosing the appropriate action by steering right or left according to the 
given situation as shown in Figure 10 where the evolution of v and (j) is given. At point A, it 
stops because it finds itself in a blocked situation (walls at both sides and obstacle in front). 



Robucar initial 
position (O) 



Obstacles 



Robot 
trajectory 




Blocked way 



Fig. 9. Corridor environmentl with a blocked way. 

The robot at point O in Figure 11 could not reach its target directly because of the wall at its 
right, so it goes strait until it finds a way to turn right (Figure 11 at point A and Figure 12 at 
time Is). Afterwards, it goes strait ahead until point B where it steers to reach its target. 
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Fig. 10. Evolution of steering angle and velocity of corridor environment!. 
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Fig. 11. Corridor environment2. 
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Fig. 12. Evolution of steering angle and velocity of corridor environment2. 
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The oscillations in Figure 12(a) are due to the fact that the robot tries to point to its target 
each time it doesn't detect obstacles but the wall pushes it away until point B. 

4. Experiments of the neural-based navigation on the Robucar 

In this section, experimental results are given for different environments. 

Figure 13 corresponds to the confined corridor of CDTA. Obstacle A is put to force the robot 

to steer right and obstacle B to block a way. 

The experiments show that the Robucar behaves intelligently since it moves avoiding 

collisions with walls (walll and wall2) and obstacle A. At this point, it turns right and goes 

forwards until it detects obstacle B and stops because it finds itself in a dead zone. Note that 

the same behavior has been observed in the simulation of Figure 9. 

Figure 14 (a) gives the trajectory of the Robucar which corresponds to the corridor 

configuration. The evolution of the steering angle and velocity is illustrated in Figure 14 (b) 

and (c). 

The example of Figure 15 shows a Robucar moving in a dynamic partially structured 

environment. It avoids the obstacles and reaches its target as shown in Figure 16. At point 

(c), suddenly an obstacle is put in the robot's trajectory to cause a dead zone and moved 

after a while (see Figure 16(c) where the velocity between 14s and 19s is equal to zero). The 

robot stops when it detects the obstacle and restarts when the obstacle is taken out and 

reaches its target at location (5m, -5m). 
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Fig. 13. Internal environment. 
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Fig. 14. Robucar trajectory and evolution of steering angle and velocity (internal 
environment). 
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Fig. 15. External environment. 
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(a) Robucar trajectory 
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Fig. 16. Robucar trajectory and evolution of steering angle and velocity (external 
environment). 

5. Conclusion 

In the implemented neural-based navigation, the two intelligent behaviors necessary to the 
navigation, are acquired by learning using GBP algorithm. They enable Robucar to be more 
autonomous and intelligent in partially structured environments. Nevertheless, there are a 
number of issues that need to be further investigated. At first, the Robucar must be 
endowed with one or several actions to come back to eliminate a stop in a dead zone 
situation. Another interesting alternative is the use of a better localization not only based on 
odometry but by fusing data of other sensors such as laser scanner. 
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1. Introduction 

In recent years, the problem of determining the asymptotic stability region of autonomous 
nonlinear dynamic systems has been developed in several researches. Many methods, 
usually based on approaches using Lyapunov's candidate functions (Davidson & Kurak, 
1971) and (Tesi et al., 1996) which altogether allow for a sufficient stability region around an 
equilibrium point. Particularly, the method of Zubov (Zubov, 1962) is a vital contribution. In 
fact, it provides necessary and sufficient conditions characterizing areas which are deemed 
as a region of asymptotic stability around stable equilibrium points. 

Such a technique has been applied for the first time by Margolis (Margolis & Vogt, 1963) on 
second order systems. Moreover, a numerical approach of the method was also handled by 
Rodden (Rodden, 1964) who suggested a numerical solution for the determination of 
optimum Lyapunov function. Some applications on nonlinear models of electrical machines, 
using the last method, were also presented in the Literature (Willems, 1971), (Abu Hassan & 
Storey, 1981), (Chiang, 1991) and (Chiang et al., 1995). In the same direction, the work 
presented in (Vanelli & Vidyasagar, 1985) deals with the problem of maximizing 
Lyapunov's candidate functions to obtain the widest domain of attraction around 
equilibrium points of autonomous nonlinear systems. Burnand and Sarlos (Burnand & 
Sarlos, 1968) have presented a method of construction of the attraction area using the 
Zubov method. 

All these methods of estimating or widening the area of stability of dynamic nonlinear 
systems, called Lyapunov Methods, are based either on the Characterization of necessary 
and sufficient conditions for the optimization of Lyapunov's candidate functions, or on 
some approaches using Zubov's digital theorem. Equally important, however, they also 
have some constraints that prevented obtaining an exact asymptotic stability domain of the 
considered systems. Nevertheless, several other approaches nether use Lyapunov's 
functions nor Zubov's which have been dealt with in recent researches. 
Among these works cited are those based on topological considerations of the Stability 
Regions (Benhadj Braiek et al., 1995), (Genesio et al., 1985) and (Loccufier & Noldus, 2000). 
Indeed, the first method based on optimization approaches and methods using the 
consideration of Lasalle have been developed to ensure a practical continuous stability 
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region of the second order systems. Furthermore, other methods based on interpretations of 
geometric equations of the model have grabbed an increasing attention to the equivalence 
between the convergence of the linear part of the autonomous nonlinear system model and 
whether closed Trajectories in the plan exist. An interesting approach dealing with this 
subject is called trajectory reversing method (Bacha et al. 1997) and (Noldus et al.1995). 
In this respective, an advanced reversing trajectory method for nonlinear polynomial 
systems has been developed (Bacha et al.,2007).Such approach can be formulated as a 
combination between the algebraic reversing method of the recurrent equation system and 
the concept of existence of a guaranteed asymptotic stability region around an equilibrium 
point. The improvement of the validity of algebraic reversing approach is reached via the 
way we consider the variation model neighbourhood of points determined in the stability 
boundary's asymptotic region. An obvious enlargement of final region of stability is 
obtained when compared by results formulated with other methods. This very method has 
been tested to some practical autonomous nonlinear systems as Van Der Pool. 

2. Backward iteration approaches 

We attempt to extend the trajectory reversing method to discrete nonlinear systems. In this 
way, we suggest two different algebraic approaches so as to invert the recurrent polynomial 
equation representing the discrete-time studied systems. The enlargement and the exactness 
of the asymptotic stability region will be considered as the main criterion of the comparison 
of the two proposed approaches applied to an electrical discrete system. 

2.1 Description of the studied systems 

We consider the polynomial discrete systems described by the following recurrent state 
equation: 

X M =F{X M )=±F r X^ (1) 

where Fi,i=l,...,r are (n x nffl) matrices and Xk is an n dimensional discrete-time state vector. 

XkW designates the i th order of the Kronecker power of the state Xk. The initial state is 

denoted by Xo. Note that this class of polynomial systems (1) may represent various 

controlled practical processes as electrical machines and robot manipulators. 

It is assumed that system (1) satisfies the known conditions for the existence and the 

uniqueness of each solution X(k,Xo) for all k, with initial condition X(k=0)=Xo. 

The origin is obviously an equilibrium point which is assumed to be asymptotically stable. 

The region of asymptotic stability of the origin is defined as the set Q. of all points Xo such 

that: 

VX e D, Vk e N,X(k, X )e SR and lim X (k, X ) = ( 2 ) 

So, we can find an open invariant set Q, with boundary T such that Q is a region of 
asymptotic stability (RAS) of system (1) defined by the property that every trajectory 
starting from Xo £ Q. reaches the equilibrium point of the corresponding system. 
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Note that determining the global stability region of a given system is a difficult task. In this 

respect, one often has to be satisfied with an approximation that leads to a guaranteed stable 

region in which all points belong to the entire stability region. 

In the forthcoming sections, we try to estimate a stability region of the system (1) included in 

the entire RAS. The main way to get a stable region Q is to use the reversing trajectory 

method called also backward iteration. 

For a discrete nonlinear system with a state equation (1) the backward iteration means the 

running of the reverse of the discrete state equation (1) which requires to explicit the 

following retrograde recurrent equation: 

X t =F-'(X k+l ) (3) 

Note that this reverse system is characterized by the same trajectories in discrete state space 
as (1). So, it is obvious that the asymptotic behaviour of trajectories starting in the region of 
asymptotic stability Q. is related to its boundary T and always provides information about it. 
In order to determine the inverted polynomial recurrent equation, we expose in the next 
section two dissimilar digital backward iteration approaches by using the Kronecker 
product form and the Taylor expansion development. 

2.2 Proposed approaches for the formulation of the discrete model inversion 

The exact determination of the reverse polynomial recurrent equation (3) could not be 
reached. For the achievement of this target, most of the methods that have been suggested 
are based on approximation ideas. 

• First approach 
In the first method, we suggest to use the following approximation: 

X k = X k + l + e(X t + l ) (4) 

where s(Xk+i) is assumed to be a little term, which we will explicit. This assumption requires 
a suitable choice of the sampling period. Without loss of generality we will develop the 
approach of expliciting the term £(Xk+i) for the case r=3. The obtained results can be easily 
generalized for any polynomial degree r by following the same principle. 
So, we consider then the following recurrent equation: 

X k+x =F x .X k+ F 2 .X k [2] + F,.X k ^ (5) 

Replacing in (5) Xk by its expression (4) and neglecting all terms eW (Xk+i) for n>l, one can 
easily obtain the following expression of s(Xk+i). 



<x k+x )-- 



F l+ F 2 .{X k+l ®I n+ I n ®X k+1 )+ 
F 3 .(x k+l <g>/„ ®X k+l +X^ <g>/„ +/„ ®x| 2 i)_ 
X M -(F,X k+l + F 2 X® +F 3 X}:1 +F 4 jrW)] 



(6) 



Then, the RAS may be well estimated by means of a convergent sequence of simply 
connected domains generated by the backward iterations (4) and (6). 
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• Second approach 
The second proposed technique of the inversion of the model (1) is made up by the 
characterization of the reverse model: 



X k =F-'{X M )=G{X k+l ) 
by a r-polynomial vectorial function G(.) i.e.: 

X k =±G r xH 



(7) 



(8) 



where Gj, i=l,...,r are matrices of (nx nW) dimensions. 

Hence, it is easy to identify the Gi matrices in (8) by writing F\G\X k+l ))= X k+l which 

leads to the following relations: 



g^g^f; 



G 2 — G 2 



-Fj F 2 .G 2 



■Gl=-K l \Y F i- Gi r 



where G' p , for i=2,...,r and p=2,...,r verily the following recurrent relations: 






G 2 p =tk-^G)) 



G ; = zfe® G )) 



(9) 



(10) 



• Evolutionary algorithm of backward iteration method 
By using one of the presented approaches of the reversing recurrent equation of system (1) 
formulated above, the reversing trajectory technique can be run by the following conceptual 
algorithm. 

1. Verify that the origin equilibrium of the system (1) is asymptotically stable i.e. 

Ik to )|| < i ■ 

2. Determine a guaranteed stable region (GSR) noted Clo using the theorem 1 proposed in 
(Benhadj, 1996a) and presented in page 64. 

3. Determine the discrete reverse model of the system (1) using the first or the second 
approach. 

4. Apply the reverse model for different initial states belonging to the boundary To of the 
GSR n . 
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The application of the backward iteration k times on the boundary To leads to a larger 

stability region Qk such that L2 d Ll l CZ ... CZ il^j <ZZ Ll k . 

The performance of the backward iteration algorithm depends on the used inversion 
technique of the polynomial discrete model among the above two proposed approaches. 
In order to compare the two formulated approaches, we propose next their implementation 
on a synchronous generator second order model. 

2.3 Simulation study 

We consider the simplified model of a synchronous generator described by the following 
second order differential equation (Willems, 1971): 



a h sin (S + S )- sin \S ) = 



dt 2 dt 

where So is the power angle and 8 is the power angle variation. 



(11) 



I x, = 8 



The continuous state equation of the studied process for the state vector: J ^5 i s 



given by the following couple of equation: 



dt 






X2 = -ax 2 -sin(xj +S )+sinS 



where a is the damping factor. 

This nonlinear system can be approached by a third degree polynomial system: 



X, 



A V X k + A 2 .X k [2] + A 3 .X k [i] 



(12) 



(13) 



with 



and 



COSS n 



a 



' 
sin J 

v 2 




cos S„ 



0, 



0^ 




The discretization of the state equation (13) using Newton-Raphson technique (Jennings & 
McKeown, 1992), (Bacha et al, 2006a) ,(Bacha et al, 2006b) with a sampling period T leads to 
the following discrete state equation of the synchronous machine: 
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X k+x =F x .X k +F 2 .X^+F,.X^ 



(14) 



with 



F 



( 1 


T > 


> F 2 = 


^- T cos S 


1 - aT j 





f 0^ 



jT. sin S n 







J 



and 



F, 







T. cos S„ 



S =0.412 
With the following parameters: J a _ Q 5 

r = o.05 



one obtains the following numerical values of the matrices Fi, i=l, 2, 3. 



1 0.05^ 

1 -0.0458 0.975 ' 2 0.0076 



^"3 = 



( 



0.01 



V 
One can easily verify that the equilibrium Xe=0 is asymptotically stable since we have 

Our aim now is the estimation of a local domain of stability of the origin equilibrium Xe=0. 
For this goal, we make use of the backward iteration technique with the proposed inversion 
algorithms of the direct system (14) applied from the boundary To of the ball Qo centred in 
the origin and of radius Ro=0.42 which is a guaranteed stability region (GSR) that 
characterized the method developed in (Benhadj, 1996a). 

• Domain of stability obtained by using the first approach of discrete model 
inversion : 
The implementation of the first approach of the discrete model inversion described by 
equation (4) leads, after running 2000 iterations, to the region of stability represented in the 
figure 1. 

Figure 2 represents the stability domain of the discrete system (14) obtained after running 
the backward iteration based on the inversion model (4) 50000 times. 

It is clear that the domain obtained after 50000 iterations is larger than that obtained after 
2000 iterations and it is included in the exact stability domain of the studied system, which 
reassures the availability of the first proposed approach of the backward iteration 
formulation. 
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X2k 




Fig. 1. RAS of discrete synchronous generator model obtained after 2000 backward iterations 
based on the first proposed approach 



X2& , 






Fig. 2. RAS of discrete synchronous generator model obtained after 50000 backward 
iterations based on the first proposed approach. 

• Domain of stability obtained by using the second approach of discrete model 
inversion 
When applying the discrete backward iteration formulated by using the reverse model (8) 
we obtain the stability domain shown in figure 3 after 1000 iterations and the domain 
presented in figure 4 after 50000 iterations. 

In figure 4 it seems that the stability domain estimated by the second approach of backward 
iteration is larger and more precise than that obtained by the first approach. The reached 
stability domain represents almost the entire domain of stability, which shows the efficiency 
of the second approach of the backward iteration, particularly when the order of the studied 
system is not very high as a second order system. 



62 



New Approaches in Automation and Robotics 











\n4^-4^ 1 ! 






i^^^^^^ 






1 ^^^^5i4' 






1 ^H\ 











Fig. 3. RAS of discrete synchronous generator model obtained after 1000 backward 
iterations based on the second proposed approach 
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Fig. 4. RAS of discrete synchronous generator model obtained after 50000 backward 
iterations based on the second proposed approach 



2.4 Conclusion 

In this work, the extension of the reversing trajectory concept for the estimation of a region 

of asymptotic stability of nonlinear discrete systems has been investigated. 

The polynomial nonlinear systems have been particularly considered. 

Since the reversing trajectory method, also called backward iteration, is based on the 

inversion of the direct discrete model, two dissimilar approaches have been proposed in this 

work for the formulation of the reverse of a discrete polynomial system. 

The application of the backward iteration with both proposed approaches starting from the 

boundary of an initial guaranteed stability region allows to an important enlargement of the 

searched stability domain. In the particular case of the second order systems, the studied 

technique can lead to the entire domain of stability. 

The simulation of the developed algorithms on a second order model of a synchronous 

generator has shown the validity of the two approximation ideas with a little superiority of 

the second approach of the discrete model inversion, since the RAS obtained by this last one 

is larger and more precise than the one yielded by the first approximation approach. 
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3. Technique of a guaranteed stability domain determination 

In this part we consider a new advanced approach of estimating a large asymptotic stability 
domain for discrete time nonlinear polynomial system. Based on the Kronecker product 
(Benhadj Braiek, 1996a; Benhadj Braiek, 1996b) and the Grownwell-bellman lemma for the 
estimation of a guaranteed region of stability; the proposed method permits to improve 
previous results in this field of research. 

3.1 Description of the studied systems 

We consider the discrete nonlinear systems described by a state equation of the following 
form 

X(k + 1) = F(x(k)) = £ AX ll] (k) (15) 

where k is the discrete time variable, X(k) e 5R" is the state vector, X (k) designates the i- 
th Kronecker power of the vector X (k) and A-,i = l,...,q are (n x «') matrices. The 
system (15) can also be written in the following form: 

X(k + 1) = M(X(k))X(k) (16) 

where: 

M(X(k)) = A, + £ A j (I n <g> X [J ~ 1] (X(k)) (17) 

j=2 

where ® is the Kronecker product (Benhadj Braiek, 1996a; Benhadj Braiek, 1996b). 
Assumption 1: The linear part of the discrete systems (15) is asymptotically stable i.e. all the 
eigenvalues of the matrix are of module little than 1. 

3.2 Guaranteed stability region 

Our purpose is to determine a sufficient domain Do of the initial conditions variation, in 
which the asymptotic stability of system (15) is guaranteed, according to the following 
definition: 

\/X e n () X(k,k ,X )e 9? Vk 
and \im X(k,k ,X ) = ( 18 > 

where X(k,k ,X„) designates the solution of the nonlinear recurrent equation (15) with 
the initial condition X(k ) = X ■ 

The stability domain that we propose is considered as a ball of radius R Q and of centre the 
origin X = i.e., 

Cl ={x o eM";\\X \\<R } (19) 
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the radius R Q is called the stability radius of the system (15). 

A simple domain ensuring the stability of the system (15) is defined by the following 
theorem (Benhadj Braiek, 1996b). 

Theorem 1. Consider the discrete system (15) satisfying the assumption 1, and let c and « 
the positive numbers verifying a e 10 1 1 , 

\\A^ k °\\< ca k - k °V k > k (20) 

Then this system is asymptotically stable on the domain Clo defined in (19) with i? the 
unique positive solution of the following equation: 

±yX-' J — = (21) 

where y, , k — 2,...,q denote : 

n = c k ~ l \\A\\ (22) 

Furthermore the stability is exponentially. 
Proof. The equation (15) can be written as: 

X(k + 1) = A x X(k) + h(X(k))X(k) (23) 

with 

h(X(k)) = f j A J (I n ®X lJ - 1] (k)) (24) 

1=2 

Let us consider that: 

\/k>k ,\\X(k)\\<R (25) 

then we have, using the matrix norm property of the Kronecker product 

|A(*W)| < k(R) (26) 

with: 

MR) = ±\\A i \\R / -< (27) 

j=2 

By using the lemma l(see the appendix), we have: 

|Lsr(Jfc)| < c(a + cX{R)) k - k " \X{kJ\ (28) 
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with g(X) = h{X)X ,we have \g(X)\ < X{R)\X\ . 
Then, if: 

X(R) < — (29) 

c 

Now, to ensure the hypothesis (25) it is sufficient to have (from (21)): 

c|X(& )| < i?j or \\X(k )\\ <R Q =^- (30) 

c 

R satisfies the equation (29) implies that R satisfies the equation (21) of the theorem 1. 

3.3 Enlargement of the guaranteed stability region (GSR) 

Our object in this section is to enlarge the Guaranteed Stability Region Qq characterized in 
the section 3.2. For this goal, we consider the boundary /o of the obtained GSR of radius Ro. 
Let X'o be a point belonging in To, and X'k the image of X'o by the F\ . ) function 
characterizing the considered system, k times. 

X[=F k (X' Q ) (31) 

X'k is then a point belonging in the stability domain Qo; 

||*i|<*o , \\F k (X>)\\< R (32) 

To enlarge the GSR, we will look for a radius ro,i such that for any initial state Xo verifying 

\X -Xi\<r a . 
one has 

X k =F k (X )^n (33) 

and the fact that after k iterations the state of the system attends the domain fio ensures that 
Xo is a state belonging in the stability domain. 
Let us note: 

5X =X -X; (34) 

And for k > 1 

SX k =X k -X' k =F k (X )-F k {x' ) (35) 

5Xu can be expressed in terms of SXo as a polynomial function of degree s=q k where q is the 
degree of the F[ , J polynomial characterizing the system: 
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SX k = E V SX +E 2 .8xf +...+E s .SX^ ] ; s = q k 



(36) 



Ei, E2. . ., E s are matrices depending on k and X'o and they can easily expressed in terms of A i 

and X'o. 

In the particular case where q = 3 and k = 1 one has: 



where 



Z) 



D 



5X l =X x -X[=F(X )-F(x- ) 

= D x .SX + D 2 . Jxf ] +... + D, .SX [ * 



A, + A 2 (X,_ t ® /„)+ A 2 (/„ ® *,,») + 

A 3 ((X,_ t ® 7„)® X ,_ t ) + 

^ 3 ( X [2 ] ® 7 „ ) + 

^ 3 ((/„ ® X Uk )® X .,,) 

A + ^3 ((*,.* ® /, )® !») + 
A ((/, ® *,,*)® '„) + 

^ (^r ] ® *,,*) 



(37) 



D 3 = ^ 3 



From the relation: 



one has: 



From (36) we have: 



with: 



Hence we have: 



X k =SX k +F k (x' ) 
F k (X') 



iJ<FJ + 



\\SXA <^. \\SX Q \\+e 2 . 



SXY 



+ ...+e. 



SX> 



e j =\\Ej\\, j = 1,2,. ..,5 



(38) 



(39) 



(40) 
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<±e j .rl i+ F k {X' a ) 



7 = 1 



Since it is desired that: 



XJ<i?- (X k eCl ) 



it will be sufficient to have: 



i>x=*o-^( x o) 



7 = 1 



e x- r oA +e 2-r+- + e s - r l =R o 



F k (X') 



>0 



which yields: 



SX = X -X' < % 



where ro,i is the unique positive solution of the polynomial equation: 



e r r j+ e 2-r ,+••• + e 2-K,= R * 



F k {K) 



and this result can be stated in the following theorem. 

Theorem 2 

Let the following polynomial discrete system described by: 



x k+i =F{ x k )= A rX k +A^ ] +-+4-4 ] 



(41) 



(42) 



(43) 



(44) 



(45) 



(46) 



and let Q the GSR of radius R Q given in theorem 1, and Y Q the boundary of the GSR, then: 

For any point XL gT., the ball Q centred on X and of radius r 0/ the unique positive 

solution of the equation (45) is also a domain of stability of the considered system. 
In the particular case where consider k=l, one has the following corollary. 
Corollary 1 

The ball B, of radius r Q . solution of the equation: 



\\D,\\.r n .+\\D J \\.K.+... + \\D \\r<=IL- F{xi) 

II 1 1| 0,z || 2 1| 0,; q 0,z TJ \^ J 

is a domain of asymptotic stability of the considered system. 



>o 



(47) 
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After considering all the points X Q € Y Q (varying i), a new domain of stability is obtained 
by collecting all the little balls £2 . to &2 : 



D = uQ 



(48) 



For all the considered points X t and the associated balls Q , , we can construct a new 

domain of stability /+1 with a boundary r /+1 , and we have Q, c f2 /+1 . This procedure can 

be repeated with these new data /+1 and r, +1 until obtaining a sufficiently large stability 

domain of the considered system equilibrium points. 
This idea is illustrated in Figure 5. 




Fig. 5. Illustration of the principle of the proposed method 



3.4 Simulation results: application to Van Der pool model 

Let us consider the following discrete polynomial Van Der Pool model obtained from the 
Raphson-newton approximation: (Jening & Mc Keown, 1992) 



X^=AX k +A,X k 



[3] 



(49) 



Where X , 



k 2A- 



A, 



'0.9988 
0.0488 



-0.0488^ 
0.950 



(0 -0.0012 



V 



0.0488 



Equation (49) has a linear asymptotically stable matrix Ai, which verifies the inequalities (20) 
with c—1.7 and a=0.65. Then, we may conclude that the origin is exponentially stable for 
each initial state Xo included in the disc Qo centered in the origin and of radius Ro=0.33. 
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Figure 6 shows the guaranteed stability domain Qo obtained by the application of the 
theorem 1, and the enlarged region resulting from the application of the theorem 2 for one 
iteration (k=l), and for 22 points XL on the boundary To- It comes out that the new result 

stated in the theorem 2 leads to an important enlargement of the guaranteed stability 
domain. 



X 




■0.9 -0.6 -D.4 -0.2 D.2 0.4 0.G 0.8 

Fig. 6. Enlargement of a guaranteed RAS estimate of Van Der Pool discrete model 



4. Conclusion 

An advanced discrete algebraic method has been developed to determine and enlarge the 

region of asymptotic stability for autonomous nonlinear polynomial discrete time systems. 

The exactness of the obtained RAS in this case constitutes the main advantage of the 

proposed approach. 

The proposed technique is proved theoretically and tested via numerical simulation on the 

discrete polynomial Van Der Pool model. 

The original discrete developed method is equivalent to the reversing trajectory method 

which used to determine the RAS for continuous systems. 

Further research will be focused on the development and the implementation of an optimal 

numerical tool which allows to reach the larger region of asymptotic stability for discrete 

nonlinear systems. 



70 New Approaches in Automation and Robotics 

5. Appendix 

Lemma 1 (Benhadj Braiek, 1996b): 

Let a discrete nonlinear system defined by the state equation: 

X(k + l) = A { X(k) + g(k, X(k)) (50) 

where the linear part satisfies the assumption 1, and the nonlinear part g(k,X(k)) verifies 
the following inequality : 

g(k,X(k))<j3\\X'k)\\ (51) 

where p is a positive constant. 

Let (£>(k,kn) denotes the transition matrix of the linear part of the discrete system (50): 

<&(*,*„) = 4*"*' (52) 

and let c and a the positive numbers verifying a e 10 ll , 

|®(*,* )| < ca k ~ h \fk>k (53) 

Then the solution X(k) of the system (50) verifies the following inequality: 

\\X(k)\\ < c(a + cfi) k ~ k ' \X(k Q )\ (54) 

So if R < i _ _, the system (50) is exponentially stable. 
c 
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1. Introduction 



The use of networks as a media to interconnect the different components in an electrical 
drive control system is increasing in the last decades. Typically, the employ of a network on 
a control system is desirable when there is a large number of distributed sensors and 
actuators. Systems designed in this manner allow for easy modification of the control 
strategy by rerouting signals, having redundant systems that can be activated automatically 
when component failure occurs, and in general they allow having a high-level supervisor 
control over the entire plant. The flexibility and ease of maintenance of a system using a 
network to transfer information is a very appealing goal. Due to these benefits, many 
industrial companies and institutes apply networks for remote control purposes and factory 
automation (Yang, 2006), (Lian et al., 2002), (Bushnell, 2001), (Antsaklis & Baillieul, 2004), 
(Baillieul & Antsaklis, 2004). Control applications utilize networks to connect to Internet in 
order to perform remote control at much farther distances than in the past without investing 
on the whole infrastructure. 

The connection between new network based control systems and teaching allows many 
universities to develop virtual and remote control laboratories (Valera et al., 2005), (Casini et 
al., 2004), (Saad et al., 2001). For several years, at the Departments of Power Electronics and 
Electrical Drives (Baluta & Lazar, 2007) and Automatic Control and Applied Informatics 
(Carari et al., 2003), (Lazar & Carari, 2008) from "Gh. Asachi" Technical University of Iasi, 
virtual and remote laboratories for electrical drive systems and process control have 
developed using a Networked Control System (NCS). 

This chapter presents the experience of the electrical drives control group at the "Gh. 
Asachi" Technical University of Iasi in developing remote control laboratory for electrical 
drive systems. A SCAD A environment has been chosen to implement the network based 
control architecture. This architecture allows the user to remotely choose a predefined 
controller to steer the electrical drives systems or to design a new one. Using SCADA 
software facilities, students can develop themselves new networked control systems for the 
set ups from the laboratory. The main advantage of the network based control structure is 
the user interface, which allows analyze the electrical drives system performances, to tune 
the controller and to test it through the remote laboratory. During the experiments, it is 
possible to change the set point, the operating mode and some typical controller parameters. 
Experimental results can be displayed showing the real running experiment and can be 
checked through on-line plots. 
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The chapter is organized as it follows. Section 2 illustrates the main features and the 
architecture of the networked control system laboratory. In Section 3, typical working 
sessions are described. Section 4 provides conclusions and future developments. 

2. Remote control architecture 

In order to achieve the remote control of the electrical drive systems, a Web server is used 
which assures the process distribution for different users (clients) via Intranet and Internet. 
The Intranet is the local computer network of the Department of Power Electronics and 
Electrical Drives from "Gh. Asachi" Technical University of Iasi. The electrical drive systems 
distribution is realized using a NCS architecture, which implements both configurations: 
direct structure and hierarchical structure (Tipsuwan, 2003). 



2.1 NCS architecture 

The developed NCS architecture has the layout from Fig. 1. 
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Fig. 1. Remote control architecture. 

A user can have access to the process and run an experiment in real time using Intranet and 
Internet. The user can design and implement different control structures for electrical drive 
systems employing SCADA software facilities or, for a given control structure, he is able to 
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implement and test PID control algorithms and tuning procedures. SCADA software 

enables programmers to create distributed control applications having supervisory facilities 

and a Human-Machine Interface (HMI). As SCADA software, Lookout is used for the direct 

structure and Lab VIEW for the hierarchical structure. All external signals start and arrive at 

HMI/SCADA computer. 

The laboratory architecture allows running experiments while interacting with instruments 

and remote devices. The I/O remote devices permit data acquisition from sensors and 

supplying control signals for actuators, using A/D and D/A converters. 

The NCS architecture offers the possibility to remotely choose a predefined control structure 

to handle the electrical drives system variables or to design a new control application, using 

SCADA software facilities. 

In the first case, using the remote control architecture the students have the possibility to 

practice their theoretical knowledge of electrical drive systems control in an easy way due to 

process access by a friendly user interface. The second opportunity offered to the students is 

to design a new networked control architecture which allows creating a new HMI/SCADA 

application to remotely control a process, using Lookout and, respectively, LabVIEW 

facilities (Carari et al., 2003). 

The software architecture can be split in two parts: one concerns the control of the physical 

process - server side and the other relates to the user interface - client side. The server runs 

on the Microsoft Windows NT platform and is based on Lookout for direct structure and, 

respectively, LabVIEW environment for hierarchical structure. 

The server application contains the HMI interface and fulfils the following functions: 

• implements the control strategies; 

• communicates with I/O devices through object drives; 

• records the signals in a database; 

• defines the alarms. 

The client process contains a HMI interface, similar or not with those from server 
application, and has the following characteristics: 

• allows modifying remotely the parameters defined by application server through a Web 
site; 

• communicates with server application; 

• displays the alarms defined by the server application. 

The remote control architecture is mainly intended for educational use and it is employed 
for electrical drive control course. The aim is to allow students to put in practice their 
knowledge of electrical drives and control theory in an easy way without restrictions due to 
process availability through laboratory and project works. One of the main features is the 
possibility of integrating in the control loop of the remote process the user-designed 
controller. The interface for the controller synthesis is very friendly. 

2.2 NCS in the direct structure 

The NCS in the direct structure is composed of a computer of the Intranet, called 
HMI/ Lookout that achieves the local communications with the process using Ethernet 
protocols. The remote electrical drive system, a D.C. brush servomotor, is connected with 
the communication module (CM) able to transfer data from/to I/O device to/from 
HMI/SCADA computer via a communication system. The communication module and I/O 
devices are implemented with National Instruments modules, FP1600 (Ethernet) for 
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communication and, respectively, Dual-Channel Modules for I/O devices. The Lookout 
environment has been chosen to implement HMI/SCADA application. For D.C. servomotor, 
a cascade control structure is used in order to control the speed from the primary loop and 
the current from the secondary loop. The current controller is locally implemented and the 
speed controller is remotely implemented using Lookout environment. The cascade control 
structure allows the monitoring of control loops variables and the command of the overload 
at the servomotor shaft. 

2.3 NCS in the hierarchical structure 

Hierarchical structure is composed of a computer of the Intranet, called HMI/ Lab VIEW 
with a PCI motion controller board (National Instruments PCI-7354) and Analog & Digital 
I/O devices. 

DSP controllers available today are able to perform the computation for high performance 
digital motion control structures for different motor technologies and motion control 
configuration. The level of integration is continuously increasing, and the clear trend is 
towards completely integrated intelligent motion control (Kreidler, 2002). Highly flexible 
solutions, easy parameterized and "ready-to-run", are needed in the existent "time-to- 
market" pressing environment, and must be available at non-specialist level. 
Basically, the digital system component implements through specific hardware interfaces 
and corresponding software modules, the complete or partial hierarchical motion control 
structure, i.e., the digital motor control functionality at a low level and the digital motion 
control functionality at the higher level (see Fig. 2). 



Communication protocols 
Reference generator 



Motion Control^y 


Speed control 
Position control 


Motor Control 


\ Current control 
\ Pulses control PWM 




1 Digital 1 




1 system 1 



Real-time operating kernel 



Fig. 2. Motion system structure hierarchy. 

The National Instruments PCI-7354 controller is a high-performance 4-axis-stepper/D.C. 
brush/D.C. brushless servomotors motion controller. This controller can be used for a wide 
variety of both simple and complex motion applications. It also includes a built-in data 
acquisition system with eight 16-bit analog inputs as well as a host of advanced motion 
trajectory and triggering features. Through four axes, individually programmable, the board 
can control independently or in a coordinated mode the motion. The board architecture, 
which is build around of a dual-processors core, has own real-time operating system . These 
board resources assure a high computational power, needed for such real-time control. 
Three electrical drive systems, based on a unipolar or bipolar stepper servomotors, D.C. 
brush servomotors and a D.C. brushless servomotors are linked to the remote control 
architecture. The connection is achieved with the I/O devices from PCI motion controller 
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board, which also contains a remote controller implemented using a DSP and real-time 
operating system, as is presented in Fig. 3. 
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Fig. 3. Motion controller board structure. 

Functionally, the architecture of the National Instruments PCI-7354 controller is generally 
divided into four components (see Fig. 4): 

• supervisory control; 

• trajectory generator; 

• control loop; 

• motion I/O. 
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Fig. 4. Functional architecture of the NI PCI-7354. 

Supervisory control performs all the command sequencing and coordination required to 
carry out the specified operation. Trajectory generator provides path planning based on the 
profile specified by the user, and control loop block performs fast, closed-loop control with 
simultaneous position, velocity, and trajectory maintenance on one or more axes, based on 
feedback signals. 

The Lab VIEW environment has been chosen to implement HMI/SCADA application. The 
development environment used to complete the applications is Lab VIEW 7.0, which beside 
the graphic implementation that gives easy use and understanding takes full advantage of 
the networking resources. Using NCS hierarchical structure, control architecture for stepper 
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servomotors, D.C. brush servomotors and D.C. brushless servomotors supervisory was 
developed which allows the following functions: 

-open-loop control of stepper servomotors using constant frequency or prescribed profile 
(trapezoidal or S-curve move profile, see Fig. 5); 
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►■ 




a. Trapezoidal. b. S-Curve. 

Fig. 5. Move Profile. 

• closed-loop control of stepper servomotors (speed and position); 

• the operating mode of the stepper servomotors can be with full step, half step or 
microstepping mode; 

• open-loop control or closed-loop control of D.C. brush servomotors; 

• self-commutation control of D.C. brushless servomotor speed, which allows very high 
speeds; 

• sensorless control for D.C. brushless servomotor. 

3. Session description 

This section presents three examples to illustrate the manipulation of instruments and real 
devices for low power electrical drives education on the Web. These examples exemplify the 
laboratory works dedicated to control a D.C. brush servomotor electrical drives system and 
a stepper servomotor electrical drives system. 



3.1 NCS in the direct structure: D.C. brush servomotor electrical drives system 

The set-up for D.C. brush servomotor control is presented in Fig. 6 and the layout can be 
inspected in Fig. 1. 

The set-up consists in a D.C. brush servomotor, a D.C. brush generator used as load, an 
incremental encoder and a tachometer as speed transducer, a controlled loading device and 
a control block which includes an analogical controller for current loop. The set-up is used 
to illustrate cascade control and how this structure can reject the load disturbances. The 
primary variable is the motor speed, the secondary is the armature current and the 
disturbance is an overload introduced with a D.C. brush generator coupled on the 
servomotor shaft. The user has to tune the two controllers of the speed and current loops in 
order to obtain optimal performances and disturbance rejection. 

When the application is started, the main panel from Fig. 7 is displayed. The main panel 
contains in the right side, the layout of the control system, real time data from the sensors 
and actuator and allows the reference setting, the overload command, the choice of 
operating mode: manual or automatic and the changing of the tuning parameters of the PI 
speed controller (Baluta & Lazar, 2007). 



Networked Control Systems for Electrical Drives 



79 



Tachometer 3jbV/RPM) D.C. Servomotor D.C. Generator Increment il lincode 




Absolute Lncoder Control Block 

Fig. 6. D.C. brush servomotor electrical drives system. 



Control led Loading Device 




Fig. 7. Main panel for D.C. brush servomotor electrical drives system. 

All these commands can be done using the left down side of the main panel. In the left up 
side, there is a monitor window, which allows the displaying of the main variables of the 
control loops. The Monitor window has the possibility to represent only the selected signals 
by using on/ off buttons or to stop the variables displaying in order to analyse the signals or 
to change the time scale through a potentiometer button. All control system real time data 
are recorded in a database and can be employed for a future analysis (Baluta & Lazar, 2007). 
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The remote control of the D.C. brush servomotors implements the following functions: 

• the reversible speed control; 

• the on-line tuning of the parameters of speed controller; 

• load command for servomotor shaft. 

The monitor window of the main panel from Fig. 7 shows both the speed response for a 
different set point step variations and the load rejection and the current evolution. Details 
regarding disturbance rejection are given in Fig. 8. The numerical controller's parameters 
which have led to such results are Kr=0.2, Ti=2 and Td=0. Due to the well-chosen and 
adjusted controller, the system's response to a step variation of the set point is very good. At 
the same time, a good compensation of the disturbance can be noticed (the load torque). 




a. Step variation of the set point. 




b. Load disturbance rejection. 
Fig. 8. Monitoring panel (set point-black, speed-brown, armature current-blue). 



3.2 NCS in the hierarchical structure: D.C. brush servomotor electrical drives system 

The D.C. brush servomotor has the following features: 
• rating armature voltage U n =20V; 
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• rating armature current I n =2A; 

• rating speed n n =2400 rpm; 

• armature winding resistance R a =1.31 Q,; 

• armature winding inductance L a =7.58 mH. 

The D.C. brush servomotor is driven by a reversible chopper with the L292 specialized 
integrated circuit (Baluta, 2004). The motor's shaft is connected with an incremental optical 
encoder that gives 1000 pulses/rev. (this means 4000 pulses/rev. at the output of the NI 
PCI-7354 controller). The D.C. brush servomotor is loaded using a controlled loading device. 
Among the features of the loading system, the authors emphasize (Baluta, 2004): 

• the possibility to load the electric drive motor with a reactive load torque in a wide 
range of speed, including very low speeds; 

• the possibility to impose a constant load torque operation mode or an overload 
operation mode. 

The implemented control system uses the algorithm from Fig. 9, which allows to generate in 
real-time the move trajectory and to change the motion parameters. In order to program the 
motion control system, the developer employs LabVIEW environment with specific virtual 
instruments for motion control from the FlexMotion library. 
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Fig. 9. Position-based straight-line move algorithm. 

Within such application, the user has the possibility to study either: 

• the command regimes in speed open-loop; 

• the position control regimes of the positioning system. 

The graphic program associated to the first type of study is that shown in Fig. 10. It is 
structured by means of a single while () loop, preceded by an initialization sequence 
allowing the configuration of the system (selection of chopper control signal, setting of 
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optical encoder parameters, definition of the speed measurement units). The control 
elements of the loop creates the possibility that the user could modify the chopper control 
signal and/ or to establish the servomotor loading degree. The measures acquired in view of 
their graphical representation are the electrical ones (current in the armature winding and 
supply modulated voltage of the armature winding) as well as the mechanical ones 
(position and speed of the drive system). Leaving off the application is achieved explicitly, 
by means the Stop push-button, available in the graphical user interface. 
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Fig. 10. Lab VIEW diagram for the command regimes in speed open-loop. 

The vast majority of motion control algorithms employed in industrial applications are of 
two forms (Ellis & Lorentz, 1999] : 

• the well-known PID position loop (Fig. 11a); 

• an average velocity loop cascaded with a position loop (Fig. lib). 
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On the other hand, in order to achieve near zero following or tracking error, feedforward 
control is often employed. A requirement for feedforward control is the availability of both 
the velocity, Q*(t), and acceleration, s*(t), commands synchronized with the position 
commands, 9*(t). 

An example of how feedforward control is used in addition to the second servocontrol 
topology is shown in Fig. 12. The National Instruments PCI-7354 controller can be 
configured for any of the above servocontrol topologies. 



£*□ — *- 



0*o-*- 



G*°- 



From 
Trajectory 
Generator 




FEEDFORWARD Control 



- v tT 



w* 



Velocity 
Command 



^ixHZr?^ 



_ 



To Drive 



Q 



K v 



Velocity ., 
Estimator 1 Q 

— U*- 



[PlVCpnttol jFrom Encoder 

Fig. 12. Basic feedforward and PIV control topology. 

In order to build a positioning application, the developer has to follow the stages shown in 
Fig. 13, which represent the steps required to design a motion application. The learner can 
also perform the second to fifth stages of motion application design (Bauer & Fedok, 2003) 
(Fedok & Bauer, 2005), (Saliah et al., 1999). 
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Fig. 13. Generic steps for designing a motion application. 

To introduce a student to motor control, the D.C. brush servomotor set-up will provide the 
opportunity to learn about this motor without ever attending a laboratory session at their 
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institution. Since the majority of students do not have prior technical experience, this remote 
set-up is very economical and offers students a great tool for learning about the 
servomechanism and data acquisition. 

To ensure proper performance for servomechanism the motion control system must be 
tuned and tested. Then the type of move profile is planed. Motion constraints are the 
maximum velocity, acceleration, deceleration and jerk that the system can handle. Trajectory 
parameters are expressed as a function of motor shaft revolutions. The trajectory generator 
takes into account the type and the constraints of motion and generates values of 
instantaneous trajectory parameters in real-time. 

Basically, the student can learn about the parameters of the D.C. servomechanism, by tuning 
the control loop parameters and setting the motion constrains to make execute the 
operations, i.e., change the rotation direction, position and speed. Also during the 
experiment, the student can visualise critical input and output points in the D.C. brush 
servomotor electronic interface using virtual oscilloscopes. Since this system is in real-time 
mode, the learner observes changes in position, direction, and velocity as the motor rotates 
after being commanded. Fig. 14 shows the graphical user interface specific to this 
application. 
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Fig. 14. Main panel for D.C. brush servomotor electrical drives system (open-loop operating 
mode). 

It reveals the experimental results being obtained for the open-loop control of the electrical 
drives system when the user applies a load torque disturbance to the controlled servomotor 
shaft. The variations of the armature current and modulated voltage applied to its terminals 
(for a command signal of 6.5 V) are presented by means of the first virtual oscilloscope. The 
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speed fall caused by the load torque disturbance can be noticed on the second virtual 
oscilloscope. Thus the user can deduce the causal link between the disturbance acting over 
the system and its effects over the output measure. 

A more complex study can by achieved by means of the second application designed for this 
stand. This is the control of a positioning system driven by D.C. brush servomotor (the 
motion control algorithms from Fig. 11 and Fig. 12). The Lab VIEW program developed this 
purpose is shown in Fig. 15. The same as for the application for the position control of the 
positioning system driven by stepper servomotor, the program is structured into two while () 
loops (the control algorithm ensures the processing flow described by the same flowchart 
shown in Fig. 9). The novel element, as compared to the structure of positioning described 
earlier, is the position and speed controller offering multiple possibilities of configuration 
for the control topology. Thus, the user can choose to use either a classical structure of 
position control by means of a PID controller, or a master-slave (cascade) structure of P- 
position & Pi-speed type. At the same time, either structure can be completed with control 
elements by feedforward in accordance with the speed and/or acceleration references of the 
trajectory generator. It is obvious that once the topology having been established, the user 
has to ensure the tuning of the control structure in view of obtaining the desired behavior 
performances. 




Fig. 15. Lab VIEW diagram for single axes positioning system (closed-loop operating mode). 

The user graphical interface (see Fig. 16) shows the experimental results obtained for a 
classical position control structured by using a PID controller. The tuning parameters of the 
control law are presented in the bottom-left area reserved for the controller settings and 
those related to the motion in the top-left part. The review of the results certifies the fact that 
the positioning within the range [0-^1000] rev. is completed in full accordance with the 
requirements imposed by the motion constraints and with optimal dynamic behavior 
performances that is little and well harmonized transitory regimes. 
Through the graphical user interfaces, the students acquire the following abilities: 
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enabling the motor and resetting the control system; 
calibrating a D.C. brush servomechanism; 




Fig. 16. Main panel for D.C. brush servomotor electrical drives system (closed-loop 
operating mode). 

• controlling the rotation direction of the D.C. brush servomotor; 

• moving the servomotor shaft from one arbitrary position to another; 

• accelerating the servomotor and maintaining a constant velocity as well as decelerating 
and bringing it to a complete stop; 

• visualizing intermediate motor control and encoder signals; 

• understanding the relationship between the external loads and command effort. 



3.3 NCS in the hierarchical structure: Stepper servomotor electrical drives system 

In order to study positioning application for stepper servomotor electrical drives, the set-up, 

which is shown in Fig. 17 and has the layer from Fig. 1, is used. 

The set-up consists of a stepper servomotor, an optical encoder, a controlled loading device 

and electronic boards for the delivery of control sequences. 

The 2-phase stepper servomotor (made in Italy, type HY 200-2222-100 D 6) has the following 

features: 

step angle p =1.8 degrees/step (200 steps/ revolution); 

rated voltage supply U n =5V; 

current per phase I n =lA; 

resistance per phase Rf=5Q; 

inductance per phase L m =5.45mH; 

electrical time constant T e =1.09ms; 

maximum synchronization torque M max =0.3N m. 
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The stepper servomotor is driven by two specialized integrated circuits (made by SGS- 
THOMSON, Microelectronics Company): L297 (stepper servomotor controller) and L298N 
(dual H-bridge driver). The servomotor's shaft is connected with an incremental optical 
encoder that gives 250 pulses/rev. (this means 1000 pulses/rev. at the output of the sense 
discriminator or the NI PCI- 7354 controller). Among the features of the loading system, the 
following ones permit (Baluta et al, 1997), (Baluta, 2003): 
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Fig. 17. Stepper motor electrical drives system. 

• to load the electric drive servomotor with a reactive load torque in a wide range of 
speed, including very low speeds; 

• to impose a constant load torque operation mode or an overload operation mode; 

• to prescribe, analogically or numerically, the amplitude and time length of the load 
torque. 

The driven axes with stepper servomotors can be controlled in open-loop using constant 
frequency or prescribed profile (trapezoidal or S-curve move profile) or closed-loop. In the 
second case, the user has to initialise the feedback for position and speed variables and to 
adequately set the control block of the pulse generation for stepper servomotor command. 
The operating mode of the stepper servomotor can be with full step, half step or 
microstepping mode. The main function of a motion controller is to assure the desired 
motion trajectories. 

The implemented control system uses the algorithm from Fig. 9, which allows to generate in 
real-time the move trajectory and to change the motion parameters. In order to program the 
motion control system, the user employs LabVIEW environment with specific virtual 
instruments for motion control from the library FlexMotion. The graphic program 
associated is shown in Fig. 18 (Baluta, 2007). 

The processing flow is structured into two loops of while () type. The inner loop, executed 
for a time period of 1 ms is used to update the temporal trajectories of the measures of 
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interest: electric (phase currents and phase supply voltages) and mechanic (position and 
speed). The leaving off of the inner loop done as the prescribed position is attained or the 
follow up speed exceeds a prescribed level. The control outer loop is executed with a period 
of 1 s and allows or one hand the configuration of the application in relation with the 
parameters of the drives system (number of stepper steps, number of pulses optical encoder) 
and on the other hand the definition of the operating regimes and modes and the motion 
parameters (full steps/half step, open-loop/ closed-loop, final position, maximum speed, 
maximum acceleration and maximum deceleration). 




Fig. 18. Lab VIEW diagram for single axes positioning system. 

The application calculates the error between the prescribed position and real position and, 
when this error is other than zero, the inner loop is executed until the position error 
becomes of no value. As one can notice, the program provides two types of configuration: 

• implicit configurations, not accessible to user (number of stepper steps, number of 
pulses/ rev. of optical encoder); 

• explicit configurations available in the user graphical interface (full steps/ half step, 
open-loop/ closed -loop, servomotor loading, position initialisation). 

Such configurations are basically achieved when the main loop is executed, therefore after 

the achieved a motion programmed profile. Nevertheless, in order to ensure to user the 

possibility of applying an overload when achieving the motion profile, the command of 

application of the overload was included in the control inner loop. 

When the controlling mode of the stepper servomotor (full step/ half step) is modified, the 

settings related of the number of steps that the servomotor is executing during a full 

revolution must be correlated as well. This is automatically achieved in a transparent 

manner from the point of view of the user. 

The application can be stopped by an explicit control through the Stop push-button, involves 

the leaving off the main loop of the program. 

In order to build a positioning application, the developer has to follow the stages shown in 

Fig. 13, which represent the steps required to design a motion application. 

The trajectory generator takes into account the type and the constraints of motion and 

generates points or instantaneous position in real-time. The control loop converts each 



Networked Control Systems for Electrical Drives 



89 



instantaneous position to a voltage or to step-and-direction signal, depending on the type of 
motor is used. Motion constraints are the maximum velocity, acceleration, deceleration and 
jerk that the system can handle. The trajectory generator creates a velocity profile based on 
these motion constraints values. Trajectory parameters are expressed as a function of steps 
numbers. In this case the number of steps depends on the type of the stepper drive and the 
servomotor that is used. 

Basically, the student can learn about the parameters of the stepper servomotor, by changing 
modes and input settings to make execute the operations, i.e., change the rotation direction, 
position and speed. Also during the experiment, the student can visualise critical input and 
output points in the stepper servomotor electronic interface using virtual oscilloscopes. 
Since this system is in real-time mode, the learner observes changes in position, direction, 
and velocity as the motor rotates after being stepped through with various frequencies 
corresponding to increasing or decreasing variations of steps/ sec. 

The graphical user interfaces for a positioning application of stepper servomotors are 
presented in: 

• Fig. 19 for open-loop control and full step operating mode; 

• Fig. 20 for closed-loop control and half step operating mode. 
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Fig. 19. Main panel for stepper servomotor electrical drives system (open-loop control and 
full step operating mode). 

The graphical interface includes three virtual oscilloscopes allowing the view of the phase 
currents through the servomotors windings, voltages applied to windings and trajectories of 
the mechanical measures (prescribed position, measured position, prescribed speed). 
Additionally, the real position is presented dynamically, by means of a graphical slider too. 
The instantaneous numerical values of the above measures are also presented using virtual 
display elements. 
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Fig. 20. Main panel for stepper servomotor electrical drives system (closed-loop control and 
half step operating mode). 

The graphical interface offers to the user certain switches, in order to establish the desired 
operating mode: 

• the initialization of the position register of the positioning system is achieved by means 
of the Reset TIRO push-button; 

• the Powered/Unpowered push-button provides a logical command of activating on the 
stepper servomotor driver; 

• the Full Step/Half Step push-button defines the stepper servomotor command mode (full 
step or half step); 

• the positioning system loading is achieved by means of a controlled loading device 
being controlled through the Load/Unload and Overload push-buttons. 

The parameters and constraints on the motion are defined by means of the introductions 
elements of numerical values (speed, acceleration, position). After such measures are 
programmed, the supervision system transfers such measures to the trajectory generator, 
which, in its turn, supplies suitable references to the control loops. Thus, the user has the 
possibility to program dynamically the motion parameters, the new values being considered 
immediately after the completion of the motion current cycle, previously programmed. Fig. 
19 shows the experimental results obtained by an open-loop control of the position of the 
positioning system between the extremes [-90-^+90] steps with the maximum speed of 10 
steps/ s and acceleration/ deceleration of 10 steps/ s 2 . The servomotor command mode is 
with full step, the positioning system being unloaded. 

The temporal trajectories of the electrical and mechanical measures, obtained for three 
positioning cycles are shown of the virtual oscilloscopes of the user graphical interface. The 
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user has thus the possibility to correlate the command and control measures with the 
operating regimes and loading modes of the positioning system. Fig. 20 shows, for instance, 
the operation regime of the positioning system in closed- loop. By a comparative study, one 
can notice that in such case, the reaction measures are easily contaminated by noise. In spite 
of that, the control block ensures the desired parameters of the motion and guarantees the 
execution of the positioning without risking of loosing steps. 
Through the user interface, the students acquire the following abilities: 

• calibrating a stepper servomotor system; 

• controlling the rotation direction of the stepper servomotor; 

• enabling the motor and resetting the control system; 

• understanding the relationship between a step and the number of steps required 
making one complete 360-degree revolution; 

• moving the motor shaft from one arbitrary position to another; 

• accelerating the motor and maintaining a constant velocity as well as decelerating and 
bringing it to a complete stop; 

• visualizing intermediate motor control and encoder signals. 

The implemented control structure for stepper motors permits the user to get familiarized 
with solving problems for positioning applications. 

4. Conclusions 

In this chapter, a remote control laboratory for electrical drive systems is presented. The 

laboratory allows the students to develop network based control systems using an 

architecture based on I/O devices, communication modules and server-client application 

implemented with Lookout and Lab VIEW environment facilities and to operate on real 

electrical drive systems through Intranet and Internet. 

Three examples were presented demonstrating the potentiality of the networked control 

system laboratory to remotely control the electrical drive systems and to develop new 

network based control system structure. 

Work is in progress to upgrade the laboratory with new electrical drive systems: D.C. 

brushless servomotor electrical drives system. 
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1. Introduction 

The problem of control performance assessment and monitoring is getting more and more 
significant because control systems have much bigger influence on accomplishing aims 
determined by companies. These aims mean achieving goals connected with quality, safety 
and profits. This justify academic and commercial interest in development of methods for 
analyzing the quality of such systems which allow to avoid unreliable human factor. The 
control system performance cannot be depicted by means of only one simple statistics. The 
whole procedure called control loops benchmarking (Harris & Seppala 2002) require much 
more complicated multi-stage process consisting of: data acquisition, analysis and 
diagnostics (making the tool based on mathematical model of process, the lower bound 
estimation, the existing control loop performance assessment, testing of the performance 
improvement using existing controller structure), retuning or control algorithm 
replacement. 

2. Background of the problem 

Complex systems usually are comprised of numerous loops which are controlled by local 
SISO controllers. Most of this industrial control loops are equipped with PID type 
controllers whose parameters are usually tuned using classical approach that neglects the 
disturbance characteristics. The decision to retune or replace any of these controllers should 
be preceded by an investigation whether and to what extent this would improve 
performance. Such procedure is referred to benchmarking or control performance 
assessment (Desborough & Harris 1992) Most of research done so far assume MV (minimum 
variance) control as the performance lower bound and variance of the system output as 
basic quantity for control quality assessment. The classical (Huang 2003) performance 
measure is as follows 

a 

y 

where the hypothetical minimum variance, a 2 mv is determined analytically by 

^ v =(/o 2 +/ 1 2 +/ 2 2 +- + /AK 2 ( 2 ) 
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and defined by impulse response coefficients fo, fi, ... fi-i, variance <r a 2 of the disturbance 
model driving noise, and discrete-time delay I. It is very important that rj can be estimated 
directly from loop operating data. The exact value of delay / is assumed to be known to the 
process engineer. Unfortunately, very often there is no pure delay in the process, and the 
value of delay r of the frequently used lag-delay model 

H(s) = — — e~" ( 3 ) 

w Ts + 1 

is then used to determine / given sampling period h. As shown in (Blachuta & Bialic 2005), in 
this case the value of index rj may be not very bad even in a relatively purely tuned control 
system hiding the fact that the best achievable accuracy referring to 1=1 can be much better. 
Moreover MV based benchmark does not take the control effort into account and because of 
large magnitudes of control signal it is often useless. 

In this respect, the modified MV control strategy with bounded control variance is used as 
benchmark in the chapter. This results in the LQG control algorithm allowing control 
performance assessment under assumption of the same control effort. 

3. Control problem statement 

It is assumed that the linear SISO plant is modeled by the following stochastic, continuous- 
time system 

^^- = Ax(t) + bu(t-t) + ci(t) ( 4 ) 

dt 

y{t) = d'x(t) (5) 

where x(f) is p - dimensional state vector, A is pxp - dimensional matrix, b, c and d are p - 
dimensional vectors. The initial condition Xg is assumed to be a normal random vector, 
xo~N(0,Qo). £(t) is a Wiener process, and var £, (t)=S{t). The time delay is defined as follows: 

T = lh-h + e (6) 

where / > 1 and <0< h. The plant is controlled by the output u(t) of a ZOH device with 
period h 

u(t)=u k , for te(kh,kh + h], A: = 0,1,..., (7) 

driven by the digital controller output u(k), which changes its values at discrete time 
instants tk=kh, k=0,l,... . The output of the system is assumed to be measured synchronically 
at instants ti as: 

y k =d'x k +n k (8) 

where n/t is measurement error composed of white noise with zero mean E[nt]=0, and 
variance E[n, 2 ]=v 2 . The variance of measurement noise characterizes accuracy of the sensor, 
transmitter and A/D converter. 

The aim of the system is to minimize the average value of the system error variance with the 
control variance limit. The considered problem is equivalent to minimization of the 
weighted H2 performance index: 
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I = XimE 



| {y 2 (t + T) + Au 2 (t)}dt 



(9) 



This index with controlled output y(t) and control signal u(t) is minimized both for optimal 
unrestricted LQG and classical restricted structure PID controllers (Grimble 2003) such that 
the maximum efficiency in terms of disturbance attenuation is achieved under bounded 
control variance. 

4. Control algorithms 

4.1 LQG benchmark 

Introduce the predictable state 



x p (t) = x(t + r) 



(10) 



The system defined by state equation (4), measurement equation (8), modulation equation 
(7) and performance index (9) can be described at sampling instants as 



xL = Fx k D +gu k +w p k 
z k = d' x k (kh-T)+n k 



1 



7 = limE — V {x p ' k Q 1 x p i c +2x p \q n u k + q 2 u$ +q w \ 

AT-*. N^ 



(11) 

(12) 
(13) 



where WkP=w(Wi+r) is zero mean Gaussian white noise vector with covariance E{ WkP,WkP'}. 
Vectors xo and [wkP,^] are uncorrelated for all k>0. Matrices defining the system (11) -(12) 
are as follows 



F(h) = e Ah , w(h)=]e Ath - s) cZ(s)ds 

o 

h A 

g (h) = \e Av bdv, W{h) = \e As cc'e A ' s ds 
o o 

and matrices defining performance criterion (13) are as follows 

" 

1 * 
qn=-\F\r)Mg{z)dz, 

" 

1 * 
q2=-\g'{r)Mg{r)dT, 

" 

q w =-d'U^F(T-s)cc'F'(T-s)dsdAd 



(14) 



(15) 
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The optimal control law minimizing the performance index (13) for the system (11)-(12) is 



u k =-k c 'xL = -k c 'F, 



- A -1 


X k\k 


U k -d 


u t-2 


u k-l _ 



= -k ' F x 



(16) 



where 

F p =[F(r) F{r-e)g{e) F(lh-2h)g •■• Fg g] (17) 

$p is an estimate of the state x p using measurements zjt up to and including k. The 

k\k k 

feedback gain vector depends on the positive solution S of the following algebraic Riccati 
equation 



S = Q l + F'SF- 



(qn+F'Sg)(q l2 +F'Sg)' 

qi+g'Sg 

_ q l2 +F'Sg 

q 2 + g'Sg 



(18) 



(19) 



Stationary Kalman filter (Astrom & Wittenmark 1990) for the system (11)-(12) takes the 
following form 



<-k\k ~ A k\k-\ 



Xntj +k z 



k'{z 



,) 



*k +i \k = F *k\k + g( h - &K-M + F ( h ~ )g{ }*k-l 



where 



(20) 
(21) 



Id 



v 1 +d'Sd 



Ydd'Y' \ 

Z = W + F\-Z- —, \F' 

v 2 + d'Yd) 



(22) 
(23) 



4.2 PID type controllers 

The system (4)-(5) controlled by means of discrete time classical PID type controllers is 

considered. Controller settings are supplied in two ways: as minimization result of the 

performance index (13), and by means of one of the classical methods called QDR (Quarter 

Decay Ratio). 

The control law for classical controllers is defined by the following equations 

-state equation 



x k+i — A x k — B z k 



(24) 
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• and output equation 



ti k — C- X k — LJ X k , 



(25) 



where for discrete time PID controller matrices and vectors defining the control law are the 
following form 



A c 



B c 



V °1, 

o oj 


c = 


~k,*- Wi] 


[:} 


D c = 


[ K HM 



(26) 



Since the predictable description of the system (11)-(12) is not useful for the PID control 
algorithm, the system defined by state equation (4), measurement equation (8), modulation 
equation (7) and performance index (9) is described at sampling instants in alternative form 



x k+1 = Fx k + r u k - M + F x u k -i + w k 

z k = d x k + n k 



where 



(27) 
(28) 



r, = e A{h ~ 0) ]e Av bdv = F(h - 0)g(0\ r = j e Av bdv = g(h ■ 



The corresponding state-space description is as follows 



(29) 
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(30) 



Introduce the notation 



P=E{x k ,X k '}: 



p p 

Ml M2 

P P 

1 21 1 22 . 



Employing (31) and (25), the performance index (13) can be rewritten as 



(31) 
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/= triQ^PuFp'+QWiT)} -tr{F p 'q l2 D c d'P n } 
- tr{ d_D c ' q n ' F p P u } + tr { q 2 dD c ' D c d' P n 
+ tr{c c 'q l2 'F p P 12 }- tr{q 2 C cl D c d'P l2 } 
+ tr{F p q 12 CP 2l }- tr{q 2 d_D'C'P 21 } 
+ tr{q 2 C c 'CP 22 }+tr{q 2 D c 'D c v 2 + q w ) 



(32) 



4.3 Output and control variances 

The output and control variances at sampling instants for the LQG controller can be 
calculated from the following expressions 



' u k LQG 



a 2 yiLQG = d<E{x k ,x k '}d_ = <rX(\Y)d 
= k c 'F p E{x klk ,x klk '}F p 'k c =k c 'F p X(22)F,;k c 
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and formulas in case of PID type controllers are defined as follows 

<?y t piD = var{ y t }=d;P n d 



al tPlD = var{ u k }= C C P 22 C C '-D C cf P I2 C C ' 

- C c P 2l dD c ' + D c d'P n dD c '+D c v 2 D c 



(33) 
(34) 



(35) 



(36) 
(37) 



5. Trade-off curves 

Relationships in two-dimensional space of such criterions as output and control signal 
variances determine the trade-off curve (Huang & Shah 1999) which separate two regions: 
achievable (above) and non-achievable (below). On the basis of point location with respect 
to the trade-off curve one can assess the control performance. Thus the trade-off curve can 
be defined by means the benchmark which minimizes quadratic performance index in the 
form of (9). This means that such benchmark acts the lower bound taking the control effort 
into account. 

Since standard deviations better characterize signal magnitudes, in the chapter trade-off 
curves are drawn on the plane std(y)-std(u). 
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Therefore the LQG control benchmark for a linear continuous-time plant whose output is 
corrupted by a stochastic disturbance controlled by a discrete-time controller is proposed. 
The quality of control systems with PID controllers tuned both classically (QDR-Quarter 
Decay Ratio) and optimally in such way that disturbance characteristics are taken into 
account is investigated and, assuming the same control effort, compared with the 
benchmark. It has been shown that optimal tuning of classical PID controllers improves the 
disturbance attenuation bringing it closer to the lower bound. 




non-achievable area 



Fig. 1. Trade-off curve 

In Fig. 1 standard deviation of output signal against standard deviation of control signal for 

systems with LQG and optimally tuned PID type controllers is plotted 1 . Optimal values of 

these parameters were received for varying values of the weighting factor X. 

Results of the minimum variance strategy are plotted as horizontal lines. And results of PID 

type control with controllers settings supplied by means of QDR method are plotted as 

points. 



h=0,1 



h=0,8 









o LQG 
X PID 

* P 

■ PID QDR . 

♦ P QDR 


■ 


\~^-^_ 




"* 










Fig. 2. Standard deviation of output vs control signal (trade-off curve) 
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6. Control performance assessment based on lag-delay model 

Suitability of control quality assessment based on delay approximation for delay-free plants 
is investigated in this section. To this end, the LQG control benchmark, which can be seen as 
a MV benchmark with bounded control variance, is compared for both linear delay-free 
continuous-time plants with outputs corrupted by a stochastic disturbance and their lag- 
delay models. Using approximated plant models, the area of achievable accuracy is then 
defined for control performance assessment. 

The transfer function G p (s) 2 is then approximated by the lag-delay transfer function (3)H p (s). 
Comparison of their step responses is given in Fig. 3. 
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Fig. 3. Step response characteristics of original delay-free plant and its lag-delay model. 

In Fig. 4 standard deviations of output signal against standard deviations of control signal 
for both delayed and non-delayed systems controlled by, respectively, optimal LQG and 
PID type controllers are plotted. The plots are parametrized by the weighting factor X. 
Results of the MV (^=0) strategy are plotted as horizontal dashed lines and define areas of 
uncertainty of the performance lower bound when using the lag-delay approximation of 
non-delayed systems. Results of the benchmark designed for original model with realistic 
control signal magnitudes (^=0.001) and PID type control with controllers settings supplied 
by means of QDR methods are plotted as points. 

The MV control strategy used for the delayed model gives lower control quality than the 
MV algorithm for the delay-free model. Furthermore there is no significant improvement of 
control quality when optimal settings for classical controllers are used for the delayed model 
as compared to those obtained by means of QDR method. 

Since control signal magnitudes of the lower bound i.e. LQG (^=0) for lag-delay 
approximation (3) and those achieved with PID controller are comparable, the most 
popular MV benchmark r\ of (1) makes sense when using delayed models, indicating that 
relatively good performance. It can unfortunately hide the possibility of further 
performance improvement when delay-free model is used. 
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Fig. 4. Areas of achievable accuracy using lag-delay approximation. 



Table 1 presents the values of index r] and standard deviation of output signal o" y for the 
systems with the plant transfer function G p (s) and H p (s). Furthermore the value of sqrt(n)*a y 
is shown which is equivalent to standard deviation of the output signal obtained when pure 
MV Astrom's algorithm is used. 



X=0 h=0.1 


Tl 


CT y 


sqrt(n) 


sqrt(r|)*a y 


LQG [GP(s)] 


0.8421 


0.0751 


0.9176 


0.0689 


PD [GP(s)] 


0.2668 


0.1333 


0.5166 


0.0689 


P [GP(s)] 


0.0701 


0.2601 


0.2648 


0.0689 


PID QDR [GP(s)] 


0.0480 


0.3145 


0.2088 


0.0689 


Pqdr [GP(s)] 


0.0230 


0.4545 


0.1517 


0.0689 


LQG [HP(s)] 


0.9998 


0.2026 


0.9999 


0.2026 


PD [HP(s)] 


0.3465 


0.3442 


0.5886 


0.2026 


P [HP(s)] 


0.2938 


0.3738 


0.7362 


0.2026 


PID QDR [HP(s)] 


0.3485 


0.3432 


0.5652 


0.2026 


Pqdr [HP(s)] 


0.1900 


0.4649 


0.4359 


0.2026 


substitute delay 


LQG [GP(s)] 


1.0000 


0.0751 


1.0000 


0.0751 


PD [GP(s)] 


0.4992 


0.1333 


0.7065 


0.0942 


P [GP(s)] 


0.4315 


0.2601 


0.6568 


0.1709 


PID QDR [GP(s)] 


0.2799 


0.3145 


0.5094 


0.1664 


Pqdr [GP(s)] 


0.1773 


0.4545 


0.2791 


0.1914 



Table 1. Values of performance measure n and estimated output standard deviation under 
pure MV Astrom's control algorithm 
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The bottom part of the table shows the values calculated for the delay-free system, G p (s), 
and calculated using the estimated substitute delay. The most important conclusion is that 
the use of the substitute delay to assess the system control performance often gives 
unreliable results. Another interesting observation is that the values from the last column 
belong to the uncertainty area of performance lower bound (see Fig. 3). This can also be 
seen from Fig. 5s where the estimates of the performance lower bound are plotted when 
assuming different values of delay /. 




3 4 5 6 7 

/ (substitute delay) 

Fig. 5. Areas of achievable accuracy using lag-delay approximation. 



7. More complete characteristic of control error 

As mentioned in the previous section using more sophisticated original delay-free model 

additional improvement of control quality can be attained for both PID and LQG 

controllers. The price paid is much larger control variance. It is interesting to note that while 

LQG systems remain robust this is not longer valid for PID controllers. 

Then in this section comparison of certain time and frequency domain functions will be 

done to give further insight into assessment of control performance in terms of system 

robustness. 

The notion of 1D-PID will be also used to denote the limited authority tuning of PID 

controller whose dynamical parameters Tj and Td are chosen from a popular tuning rule, e.g. 

the QDR method based on model (3), remain constant, and only the gain K p is chosen so as 

to minimize the index in (9). 

In Fig. 6. trade-off curves displaying standard deviations of output signals against standard 

deviations of control signals for the original delay-free system controlled by optimal LQG 

controller and by optimally tuned 1D-PID controller are plotted. Plots are parameterized by 

the weighting factor X, and results of the unrestricted MV-LQG and restricted structure, 1D- 

PID MV strategies (A,=0) are plotted as horizontal doted lines for both types of controllers. 

Due to excessive control actions and small increase of control quality MV based benchmarks 
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are not particularly authoritative [3] for non-delayed systems. Therefore, a benchmark 
which assumes restricted control effort seems to be more suitable for systems controlled by 
the PID controllers. Points a— »al, b— >bl, c— >cl depict correspondence of control systems 
with the same control effort. 





V 


V,; 


a 


0.4846 


0,6961 


b 


0.3075 


0,5545 


c 


0.3018 


0,5494 


al 


0.7932 


0,8906 


bl 


0.7071 


0,8409 


cl 


0.5853 


0,7650 



Table 2. Performance measure for systems in Fig. 6 

It is worth noting that for the MV-LQG n=0.9321 (sqrt(r|)=0,9655). The almost MV-LQG 
system represented as al has the value of n=0.7932 (sqrt(r|)=0,8906), and for a reasonably 
tuned system represented by cl there is r|=0.5853 (sqrt(r|)=0,7650), respectively. 
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Fig. 6. Standard deviation of output vs control signal for PID and LQG controlled systems. 

In Fig. 7-8 PSD 3 functions of output signals of corresponding systems are plotted and 
compared with the PSDF of disturbance. Important observation is that the optimal PID 
controller distinguishes itself by very poor robustness both in terms of phase margin and 
large sensitivity peak. This fact is reflected by the appearance of high frequency peak in PSD 
function. This is not the case for moderately tuned PID and all LQG controllers. 



3 PSD - Power Spectral Density function 
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Fig. 7. PSDF of output signals for LQG and PID loop in comparison with PSDF of 
disturbance corresponding to a, al, b, bl. 
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Fig. 8. PSDF of output signals for LQG and PID loop in comparison with PSDF of 
disturbance; corresponding to c, cl. 
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Fig. 9. Nyquist plots of the open loop and the sensitivity function for PID control 
corresponding to a, b, c. 
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In Fig. 9-11 Nyquist plots, step responses and sensitivity functions of systems corresponding 
to points a,b,c and al, bl, cl are plotted. The main outcome is that similar efficiency of 
disturbance attenuation can be attained with both MV-LQG and 1D-PID controllers. 
Unfortunately, in contrast to LQG, increasing control efficiency results in poorer robustness 
of ID -PID MV control systems. 
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Fig. 10. Nyquist plots of the open loop and the sensitivity function for LQG control 
corresponding to al, bl, cl. 
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Fig. 11. Step response characteristics of LQG and PID loops corresponding to a, b, c and al, 
bl, cl. 



8. Remarks on single-stage against multi-stage performance criteria for 
control benchmarking 

The performance measure which takes control effort into account becomes more and more 
popular. This measure result in solution of GMVC or LQG problem. And this next leads to 
the minimization of single-stage or computationaly more complicated multi-stage quadratic 
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performance criteria. The section shows that simple single-stage performance criteria have 

limited application for unstable and non-minimumphase plants. Further it was shown that 

for sampled data systems integral performance criterion which takes inter-sample behaviour 

into account is more suitable. 

Single-stage and infinite-horizon performance criteria will be compared due to their use for 

control system benchmarking. It will be shown that when using simple single-stage cost 

function, a critical value of control weighting might exist, under which the control system 

loses its stability. As far as the infinite horizon is concerned, this problem does not exist. 

Furthermore, if uncontrolled system has unstable discretization zeros, then even a single 

step cost function can assure stability of the closed loop, provided that the single stage 

discrete-time performance index is produced by integrating a continuous time index within 

the sampling period. 

To compare the system behaviour under single-stage in contrast to multi-stage criteria 

standard deviations of output and control signal in sampling instances will be used again. 

These parameters describe signal magnitudes better then variances and are also useful to 

assess the control performance. 

If the uncontrolled system is unstable or non-minimumphase, controller designed by means 

of single-stage criterion can be also unstable. This was presented in Fig. 12 and Fig. 13 by 

means of values of standard deviations of output and control signals. 

The examplary uncontrolled first-order unstable system is given by following transfer 

function: 



GAS)- 



(s-l) 



(38) 



1st order unstable system [n index] 




1st order unstable system [E index] 



0.2 0.4 0.6 0.8 1 




Fig. 12. Standard deviations of output and control signals against weighting factor X, for both 
single-stage and multi-stage criteria. 
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Vertical dashed lines represent critical values of weighting factor when the closed loop 
system controlled by algorithm based on single-stage criteria is unstable. This is not valid 
for more complicated control algorithm which minimizes multi-stage performance function. 
Fig. 13 represents results for an exemplary non-minimumphase plant described by 
following transfer function: 



G 2 (s) = 



1 



(s + l)\l + as 



1 -as 



a = 0.2 



(39) 



The closed loop system with controller designed by means of optimization multi-stage 
performance index remains stable for all values of A in contrast to this simple single-stage 
index. 
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Fig. 12. Standard deviations of output and control signals against weighting factor A. for both 
single-stage and multi-stage criteria. 

8.1 Integral cost function 

Next figures illustrate using discrete versus integral cost function for the plant given by 
following transfer function 



G 3 (s)-. 



1 



(s + 1) 



t = 0.7 



h = 0.8 



(40) 



Discretization of the transfer function Gs(s) with period /; gives unstable zeros. In contrast to 
the discrete cost function integral criterion both multi-stage and single-stage provide stable 
results for the closed loop system. 
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Fig. 13. Standard deviations of output and control signals against weighting factor X, for both 
single-stage and multi-stage criteria (discrete index). 
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Fig. 14. Standard deviations of output and control signals against weighting factor X for 
both single-stage and multi-stage criteria (integral index). 



9. Conclusion 

In the chapter some developments in the control performance assessment are provided. The 
solution based on quadratic performance criteria which taking control effort into account 
was proposed in return for popular MV measure. This further broke about the definition of 
trade-off curve using standard deviation of both control and error signals. The standard 
deviation parameter is preferred because better than variance characterize the signal 
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magnitudes. The tool in the form of trade of curve showed to be very useful for the control 
quality assessment of systems equipped with restricted structure controllers, such as PID 
type. The area of achievable accuracy, when models with substitute delay is used, was also 
defined showing the possibility of further improvement of control performance when more 
sophisticated non-delayed model of plant is applied. The price paid is much larger control 
variance and loss of robustness in case of the system with PID type controllers. Therefore 
from the technological point of view the knowledge of control error variance only is not 
sufficient enough and more complete characteristic which gives further insight into control 
performance assessment in term of robustness is necessary. In the end the problem of using 
simple single-stage and computationally more complicated multi-stage quadratic 
performance criteria was exemplified. That was pointed out existing of the critical value of 
weighting factor which result in an unstable controller design if uncontrolled system is 
unstable or non-minimumphase and the simple single-stage cost function is used. 
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1. Introduction 

Time series models (named also output or signal models), considered in this chapter, are 
functions of accessible process outputs, observed as a set of uniformly sampled data, which 
are one and the only information on the process itself. They are mainly applied in signals 
modeling and prediction. Stochastic time series models, i.e. models that use white noise 
series as a part of the model, have been used in signal analysis since the sixties of the XX 

century. Time series modeling consists in fitting a function f(j into a given data set V. . 

Thef() is a function of previous data y, , for j=l,..., ] and an innovation series u> t _ k , for 

k-1,., .,K. In general, stochastic time series model has the following form: 

y,=f(y,-j> w <-*) + w < W 

The innovation series W i use to be assumed either a white noise series e i , or series of model 
errors e t — y, — y, . The function f( ■) may be either linear or nonlinear. Though real processes 
use to be non-linear and non-Gaussian, they are often modeled as linear ARMA (Box, 1983), 
(Yaffee, 2000). 

C(D) A A 

where: 

e t - Gaussian white noise series, 

D - time delay operator: D k y t = y t _ k . 

Theory of Gaussian linear time series models, including stability and invertibility 
conditions, as well as analysis and identification methods, are well established, e.g. (Box, 
1983). However, asymmetrical time series or time series that are characterized by data 
anomalies cannot be modeled as linear. There are a great number of possible nonlinear 
structures of the function i( ■), but the most common one is a nonlinear polynomial structure 
(3). The model of such form is named nonlinear ARMA model (NARMA). 



1 1 2 New Approaches in Automation and Robotics 



Vt = w i +Yj-k w -k +EEv 2 h u h u +EEEv 2 * h u h u h u +••• 

fc=l h=lk.=l k=lk,=lk,=l 

(3) 

i h h h h h w 

+Ew,-, +EE fl «ynK-i +EEE fl ; 1 ,, 2 ,i J y.-i,M-; 2 y.-i3 +■■■ 

Bilinear ARMA models are a subset of the class of NARMA models, and are described by 
the following equation: 

K K L I 

Vi = w i + Y2 c k w i-k + Ym^Pi-kVi-i +J2 a jy-j ■ ( 4 ) 

k=\ k=l /=1 j=\ 

Nonlinear time series analysis, particularly - establishing stability and invertibility 
conditions, is in general much more complex than analysis of the linear ones. Therefore, 
only the particular model structures are being analyzed in practice. In 1978, Granger and 
Andersen derived some interesting properties of the bilinear model with the simplest 
structure (Granger & Andersen, 1978) 

3/ i =e j +/3 11 e j _i3/ j _ 1 , (5) 

where e is an independent white noise sequence with zero mean and the variance m' 2) . 

Since then, simple bilinear models have been also investigated by Martins (Martins, 1997), 
(Martins, 1999), Berlin Wu (Berlin Wu, 1995), Tong (Tong 1993), Granger and Terasvirta 
(Granger & Terasvirta, 1993). Opinion on the usefulness of bilinear series vary from a 
skeptic one "Using economic data, bilinear models have not been found to be very relevant" , (Tong 
1993) to an enthusiastic "The bilinear model has been used successfully to model time series that 
have been traditionally difficult to fit with classical linear time series methods" (Martins, 1999). The 
aim of the paper is to assume an attitude towards the above statements, especially in the 
field of technological and medical processes. In the chapter, elementary bilinear model 
EB(k,l): 

y, =™,+ PuWi-kVi-i < ( 6 ) 

where k < I , is considered and then applied in signal analysis. 

The chapter is organized in the following way: 

Section 2 is dedicated to elementary bilinear processes. Analytical relations between process 

moments and process parameters are presented for diagonal and sub-diagonal elementary 

bilinear processes. In general, they are valid under assumption that inaccessible process 

input is uncorrelated and symmetrically distributed. 

In Section 3, methods of parameters' estimation for elementary bilinear models are 

presented. Identification algorithms for simple and generalized methods of moments for 

elementary bilinear models are formulated. 

Section 4 is dedicated to application of elementary bilinear models in simulation and 

prediction. A hybrid linear-bilinear model is introduced and, on its basis, a bilinear 

minimum-variance prediction algorithm is derived, for model residuum represented by 

diagonal and sub-diagonal elementary bilinear model. 

In Section 5, the most important results are summarized. 
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2. Bilinear time series models 

Large amount of dynamical systems may be described with set of conservation equations in 
the following form: 



dx{t) 

At 



--Ax(t) + Bxi(t) + J2N k u k (tMt), (7) 



where the last term creates the bilinear part of the equation. Bilinear equations are the 
natural way of description of a number of chemical technological processes like decantation, 
distillation, and extraction, as well as biomedical systems, e.g. (Mohler, 1999), (Nise, 2000). 
Though the nature of many processes is bilinear, identification of the model (7) can be 
difficult, at least because some of the state or input variables may be immeasurable. This is 
the case of many biological or biomedical processes. Often, the discrete set of the output 
observation {yij, for i=l,...,n, is the only information on the considered process. In such 
cases bilinear time series model (8), which explains relation between the set of the output 
data only, may be considered. 

A(r')y f .C(z-> l+ iEV<-*yw (8) 

k=i '=i 

Bilinear time series models have been mentioned in control engineering since early 
seventieth. Schetzen Theorem (Schetzen, 1980) states, that any stable time variant process 
may be modeled as time invariant bilinear time series. General structure of bilinear time 
series model (8) is complex enough to make its analysis very difficult. Therefore, in practise 
the particular model structures are being analysed. 

Stochastic processes are completely characterized by their probabilistic structure, i.e. 
probability or probability density p(y) (e.g. Therrien, 1992). However, in practice, 
probabilistic structure of a considered system is unknown and, therefore, the system 
analysis is performed on the ground of its statistical moments. The moments for any 
stochastic process with any probabilistic density p(y) are expressed as: 

M<"=E{y,. r } (9) 

where E is an operator of expected value : 

E{j/} = E!/p(j/) = /i. (10) 

Central moments are: 

M'iNEfO/.-^} (11) 

When the structure of particular bilinear model is simple, the moments and the central 
moments may be analytically calculated based on the process equation, and the moments' 
definitions (9), (11). Elementary bilinear time series models, considered in this chapter, in 
dependence on their structures, are classified as sub diagonal or diagonal. 
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2.1 Sub diagonal elementary bilinear time series EB(k,l) 

When the structure k,l of elementary bilinear time series model EB(k,l) satisfy relation k <l , 
the model (12) is named sub diagonal. 

2/,=e,'+/V;-*2/,'-i- ( 12 ) 

The model is characterized by two parameters, /3 U and m] , related to each other. It may be 
proven, (e.g. Tong, 1993) that the model (12) is stable when | /3 2 ,»i' 2> |< 1 , and is invertible 
when | (S^W, |< 0.5 . Time series invertibility means that for a stable time series 



operation of inversion 



yi=/(«,ve j _j,...,e j _ t ,y j _ 1 ,...,y f _,) 



■■f(e i . 1 ,...,e i _ k y i ,y i _ l ,...,y i _i) 



(13) 



(14) 



is stable. The moments and the central moments of EB(k,l) may be analytically calculated 
based on the process equation (12), and the moments' definitions (9), (11). Relations between 
moments and parameters are given in the table 1. The variance Afj, 2) (0) of EB(k,l) is bounded 
when: 



i/Wl<i- 

The fourth moment Mj 4) (0,0,0) of EB(k,l) is bounded when: 



(15) 



!/&»' 



(16) 



Irrespective of the probabilistic density of e { , sub diagonal EB(k,l) is non-Gaussian and 
uncorrelated. Gaussian equivalent of the sub diagonal EB(k,l) with a bounded variance is a 
Gaussian white noise with the first and the second moments the same as the respective 
moments of the EB(k,l). Comparison of an EB(2,4) time series and its Gaussian equivalent is 
shown in the Fig.(l). 



Moments 


Formulae 


m.:/> 





M< 2 >(0) 
M« 2) (7«>0) 


m m 





M< 3) (0,0) 

Mf\l^k,l 2 ^l) 

Mf\k,l) 






/3 H m< 2 'Mf(0) 


M< 4) (0,0,0) 


< ) +6/3 i 2 ,(mf») 2 Ml 2 »(0) 



Table 1. Relation between moments and EB(k,l) parameters 
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Fig. 1. Comparison of the estimated moments of EB(2,4) and an equivalent white noise 

2.2 Diagonal elementary bilinear time series EB(k,k) 

Elementary diagonal bilinear time series model, EB(k,k) has the following structure: 

y i =e i +l3 kk e { _^ i _ t . 



(17) 



Properties of the model depend on two parameters, /3 kk and m' 2) , related to each other. 
Stability and invertibility conditions for EB(k,k) are the same as for sub diagonal EB(k,l) time 
series model. Having known the process equation (17) and the moments' definitions (9) and 
(11), moments and central moments of the EB(k,k)may be analytically calculated as functions 
of model parameters. Though EB(k,1) and EB(k,l) with respect to model equation are similar 
to each other, their statistical characteristics are significantly different. Relation between 
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succeeding moments and model parameters are given in the table 2. An example of a single 
realization of EB(5,5J series as well as its sampled moments is shown in the the Fig. 2. 
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Table 2. Relations between moments and EB(k,k) parameters 
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Fig. 2. EB(5,5) sequence and its characteristics 
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Diagonal EB(k,k) time series {y,} has a non-zero mean value, equal to M* 1 ' . Deviation from 
the mean z, = y t — M[, 2) is a non-Gaussian time series. A Gaussian equivalent of z t is a 
MA(k) series: 

Zj = w t + c t n>,-_ t (18) 

where if, is a Gaussian white noise series. Values of c k and m' 2) can be calculated from the 
set of equations (19): 

m< 2 >(l + /?><'>+/£(m< 2 >)*) (2) 2 

l-/3>f) -^< 1 + C *) (19) 



/£(m< 2 >) 2 = Ci m 



(2) 



3. Identification of EB(k,l) models 



Under assumption that the model EB(k,l) is identifiable, and that the model structure is 
known, methods of estimation of the model parameters are similar to the methods of 
estimation of linear model parameters. The similarity stems from that the bilinear model 
structure, though nonlinear in e t and y t , is linear in parameter j3 u . A number of estimation 

methods originate from minimization of a squared prediction error (20). Three of them, 
which are frequently applied in estimation of bilinear model parameters, will be discussed 
in the section 3.1. 

e,=y,-y«i,-i ( 20 ) 

Moments' methods are an alternative way of parameters' estimation. Model parameters are 
calculated on the base of estimated stochastic moments (Tang & Mohler, 1988). Moments' 
methods are seldom applied, because hardly ever analytical formulae connecting moments 
and model's parameters are known. For elementary bilinear time series models the formulae 
were derived, (see table 1, table 2) and therefore, method of moments and generalized 
method of moments, discussed in section 3.2, may be implemented to estimate elementary 
bilinear models parameters. 

3.1 Methods originated from minimization of the squared prediction error 

Methods that originate from minimization of the squared prediction error (20) calculate 
model parameters by optimization of a criterion /(ef), being a function of the squared 
prediction error. In this section the following methods are discussed: 

- minimization of sum of squares of prediction error, 

- maximum likelihood, 

- repeated residuum. 

a) Minimization of the sum of squares of prediction error 

Minimization of the sum of squares of prediction error is one of the simplest and the most 
frequently used methods for time series model identification. Unfortunately, the method is 
sensitive to any anomaly in data set applied in model identification (Dai & Sinha, 1989). 
Generally, filtration of the large data deviation from the normal or common course of time 
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series (removing outliers) precedes the model identification. However, filtration cannot be 
applied to the bilinear time series, for which sudden and unexpected peaks of data follows 
from the bilinear process nature, and should not be removed from the data set used for 
identification. Therefore, the basic LS algorithm cannot be applied to elementary bilinear 
model identification and should be replaced by a modified LS algorithm, resistant to 
anomalies. Dai and Sinha proposed robust recursive version (RLS) of LS algorithm, where 
j3 u parameter of the model EB(k,l) is calculated in the following way: 



&K.i = &H,i-i+fci(y,--*Au-i) 

a , + *fPi-i 
i 

Oi, 



k i = :L' ( 2i ) 

p. 2 $ 2 



where: 

- b u , — evaluation of model parameter [3 k] calculated in i-th iteration, 

- <!?( = w jk y it — generalized input, 

- W f =y,~ $jfe H ,_i — one step ahead prediction error, 

- a t — coefficient that depends upon the prediction error as follows: 

sign(w,) Vlresh ,0, 

iui WAS tf tresh 
w i 

1 for Iw.-I^j/^ 

Viresi, " a threshold value 
b) Maximum likelihood 

Maximum likelihood method was first applied to bilinear model identification by Priestley 
(Priestley, 1980) then Subba (Subba, 1981), and others e.g. (Brunner & Hess, 1995). In this 
method, elementary bilinear model EB(k,l) is represented as a function of two parameters 

ymodeA^kl'Vi-k) ■ 

y m „dei = Ki/i-kWi-i ( 22 ) 

where w t is an innovation series, equivalent to the model errors: 

w i =Vi -y m0 de,( b kuVi-k)- (23) 

Likelihood is defined as: 

N 

L = L(b k „m^) = llf(b kl ,m^;w,) (24) 

1 = 1 

Maximization of L is equivalent to minimization of -l=-ln(L): 

-l(b kl M?) = -Eln(/(b H ,mif» ;H ;,.)) (25) 
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Assuming that w t is a Gaussian series with the mean value equal to zero, and the variance 
equal to m^ 2) , negative logarithm likelihood -ln(L) is: 

N N 71? 

-\n{L) = -l{w N ,W N _, Wl |b H/ m<f)) = ^-ln(2™L 2) ) + E Ar- ( 26 ) 

2 w 2m, 1 ,, ' 

Having assumed initial values b u and m^ , parameters b u and m^' are calculated by 
minimization of (26). Solution is obtained iteratively, using e.g. Newton-Raphson method. 
Essential difficulty lies in the fact that w t is immeasurable and, in each iteration, should be 
calculated as: 

w , = y, - Ki-iWi-ki/i-i ( 27 ) 

Obtained estimates of EB(k,l) parameters are asymptotically unbiased if w { is Gaussian 

(Kramer & Rosenblatt, 1993). For other distributions, Gaussian approximation of the 

probability density function f(y t — J/ mode ,(f' tl ,i/,_ t )) causes that the estimated parameters are 

biased. 

c) Repeated residuum method 

Alternative estimation method, named repeated residuum method, is proposed in (Priestley, 

1980). Implemented to identification of elementary bilinear models, the method may be 

presented as the following sequence of steps: 

1. Model EB(k,l) is expressed as: 

yi = w,{l + b kl y,_p k ) (28) 

or equivalently: 

(29) 



' l + b^P" 

2. Assuming b u small, the (29) may be approximated by: 

w , = (i - Ky,-P k )y, = y, - Ky> -*y,-k- ( 30 ) 

Presuming w t is an identification error, an initial estimate b n of the parameter b u can 
be evaluated from the (30), with the use of e.g. LS method. 

3. Next, starting from b kl and W — , succeeding W ( can be calculated iteratively: 

w t = y< ~ bki,o w i-kyi-i f° r ' — ^/^ +1/.../N. (3i) 

4. Having known y, and w t for i=k,...N, an improved estimate b u that minimizes the 
following sum of squared errors (32) may be calculated. 

N 

v(fc H ) = E (y,-Km. k y,.,) 2 - (32) 

i=k 

5. The steps 3 and 4 are repeated until the estimate achieves an established value. 



120 New Approaches in Automation and Robotics 

3.2 Moments method 

With respect to the group of methods that originate from the minimization of the squared 

prediction error, a precise forms of estimation algorithms can be formulated. On the 

contrary, for moments method a general idea may be characterized only, and the details 

depend on a model type and a model structure. Moments method MM consists of two 

stages: 

Stage 1: Under the assumption that the model structure is the same as the process structure, 

moments and central moments M' r) are presented as a function of process parameters : 

MW=/(0) (33) 

If it is possible, the moments are chosen such that the set of equations (33) has an unique 
solution. 

Stage 2: In (33) the moments M^' 1 are replaced with their evaluation JVr r) , estimated on the 

base of available data set y, . 

MW=/(9) (34) 

The set of equations (34) is then solved according to the parameters . Taking into 
consideration particular relation between moments and parameters for elementary bilinear 
models, MM estimation algorithm in a simple and a generalized version can be proposed. 
MM - simple version 
It is assumed that w t is a stochastic series, symmetrical distributed around zero, and that 

the even moments m^'' satisfy the following relations: 

m ( l r) = k 2r {m™)' for r = 1,2,3... (35) 

Identification of EB(k,l) consists of identification of the model structure (k,l), and estimation 
of the parameters b kl and m'„ 2) . Identification algorithm is presented below as the sequence 

of steps: 

1. Data analysis: 

a. On the base of data set {j/ ; } for i=l,...,N, estimate the following moments: 

M m. M< 2) (m) form = 0,1,2...; Mf{l t ,l 2 ) for \,l 2 =0,1,2...; M< 4) (0,0,0) 

b. Find the values of \ ^ and l 2 * ( \ < l 2 ), for which the absolute value of 
the third moment M' 3 '^,^) is maximal. 

2. Structure identification: 

a. If l t —k, l 2 — l then subdiagonal model EB(k,l) should be chosen. 

b. If l 1 —k,l 2 —k then diagonal model EB(k,k) should be chosen 

3. Checking system identifiability condition: 
If the model EB(k,l) was chosen, than: 

a. Calculate an index 

(M< 3) (Jc,0) 2 
W=-^^ — — (36) 

3 (M?»(0)) 3 ( } 
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b. If W 3 <0.25 it is impossible to find a bilinear model EB(k,l) that has the same 

statistical characteristics as the considered process. Nonlinear identification 
procedure should be stopped. In such case either linear model may be assumed, or 
another non-linear model should be proposed. 
If the model EB(k,k) was chosen, than: 
a. Calculate an index 



b. If 



IV, 



W, 



M< 3) (fc,fc) 



M| 



'(O)^f(fcj 



(37) 



3 

V2 

assumed Gaussian, 
i. Calculate an index 



< £ , where e is an assumed accuracy, then the model input may be 



If 



W B 



Mf(k,k)Mf(k) 
Mf(0)Mf(0,0) 



(38) 



ii. If W 5 < 0.23 , than the model EB (k,k) with the Gaussian input may be 
applied. If not than linear model MA(k) should be taken into account. 
3 



IV, 



V2 



> e than the model input w j cannot be Gaussian. 



4. Estimation of model parameters : 

a. When the model EB(k,l) was chosen in the step 2: 
i. Find the solutions x 1 ,x 2 of the equation: 



W t =x(l-x), 



(39) 



«> 



where x ■ 

For each of the solutions x 1 ,x 2 

following equations: 

f(2) 



calculate the model parameters from the 



m w=M|; ) (o)(i-x), 



x 



(40) 



iii. In general, the model EB(k,l) is not parametric identifiable, i.e. there is no 
unique solution of the equation (39) and (40). Decision on the the final model 
parameters should be taken in dependance on model's destination. Models 
applied for control and prediction should be stable and invertible. Models 
used for simulation should be stable but do not have to be invertible. 

When in the step 2 the model EB (k,k) is chosen: 
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■ x, ,., 3 , k,-W,V2 

1. If \N. 1= >e then x= =?■ .where: 

V2 W 4 V2(fc 4 -l)-2fc 4 

. , ., fc 4 V2 3V2 
forA; 4 <3: — < W, < ' 

2 2 

for& 4 >3: <W,<— • 

2 2 

3 
ii. If W i ~ —j= , i.e. w t is Gaussian, then the folloving equation have to be solved: 

6x(l-x) 



3 + 2x + 22x A 2 



Because the model EB(k,k) with the Gaussian input is not parametric 
identifiable, the final model should be chosen according to its destination, 
taking into account the same circustances as in the paragraph a) -iii. 

MM — generalized version: 

Generalized moments method (GMM) (Gourieroux et al., 1996) (Bond et al, 2001), (Faff & 

Gray 2006), is a numerical method in which model parameters are calculated by 

minimization of the following index: 

r=E/*(y<' e ) 2 ' ( 41 ) 

where: 

© — vector of parameters, 

/ ; (l/j,©) -- a function of data y(i) and parameters ©, for which: 

E{y,/©o} = when e=6 (42) 

© — vector of parameters minimizing the index I. 

Function /,(j/,,0) for j=l,2,...,J is defined as a difference between analytical moment 

M' '(©) dependant upon the parameters 0, and the evaluation M* ' calculated on the base 

of l/j for i=l,...,N. The number / of considered moments depends on the model being 

identified. 

Identification of the subdiagonal, elementary bilinear model EB(k,l) makes use of the four 

moments. Functions / , for j=l,...,4 are defined in the following way: 

/ 1 (y ( ,6) = Mj 2 >(0)-M, p »(0) 

/ 2 (y,.,0) = M< 3 >(M)-M< 3, (W) 

/ 3 (!/,,e) = M< 4) (0,0,0)-M< 4) (0,0,0) 
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Diagonal model EB(k,k) is identified on the base of three moments. The functions f. for 
j=l,...,6 are: 

/ 1 (y«,0) = M< 1 »-M< 1 > 
/ 2 (y,,e) = M^(0)-M« 2 >(0) 
fAy,,Q) = Mf\k)-Mf\k) 
/ 4 (i/,,e) = M< 3) (o,o)-M< 3 >(o,o) 
f 5 (v,,e) = Mf\k,k)-Mf\k,k) 

/ 6 (!/„e) = m«-^> 
For elementary bilinear models vector of parameters contains two elements: m' 2) and b u . 
The parameters are calculated by minimization of the index (41), using e.g. nonlinear least 
squares method. It is assumed that starting point O = \b kl ,m^l\ is a solution obtained with 

the use of the simple method of moments. Minimum of the index I may be searched 
assuming that the parameters b u and wi„ are constrained. The constrains result from the 
following attributes: 

- The variance m' 2) of the model input should be positive and less than the output 
variance, hence: 

0<m< 2) <m< 2) , (43) 

- The model should be stable, hence: 

W<1 (44) 

3.3 Examples 

The methods discussed above were applied to elementary bilinear time series identification 
under the following conditions: 

1. Elementary diagonal and subdiagonal time series were identified. 

2. Distribution of the white noise w ( was assumed: 

• Gaussian, 

• even 

with the zero mean and the variance m' 2) = 1 . 

3. All considered processes were invertible, i.e. the parameters satisfied the following 
condition: b 2 m, (2) < 0.5 (Tong, 1993). 

4. Identification was performed for 200 different realizations of the time series consisted of 
1000 data. 

5. For generalized moments method: 

- Minimization of the performance index was carried out with the constrains: 

-0.5 - _05_ 
m\ ' rtv ' 

< m< 2) < m'P 
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- Starting point was calculated using simple moments method. 
Result of conducted investigation may be summarized as follows: 

1. Not every invertible elementary bilinear process is identifiable. 

2. Correct identification results were obtained for processes, for which /?,., <0.4, what is 

equivalent to: f3^m^ } < 0.16 . 

3. When P^ITr*' > 0.16 number of process realization, for which elementary bilinear model 

cannot be identified grows with the growth of P^m^ 1 . 

4. When f3 u < 0.4 all the tested methods give the expected values of identified parameters 

b kl equal to the truth values f3 kl . 

5. Generalized moments method is somewhat better than other considered methods, 
because the variances of the estimated parameters are the smallest. 

6. For the processes with Gaussian excitation the variances of the identified parameters 
are greater than for the processes with even distribution of the input signal. 

4. Application of EB(k,l) in signal modelling and prediction 

Elementary bilinear time series models, which statistical attributes as well as methods of 
identification have been presented in the previous sections, are fit to modelling a limited 
class of signals only. However, an idea of using EB(k,l) models as a part of a hybrid linear- 
bilinear model, let to widen the class of signals, for which improving accuracy of modelling 
and prediction become possible. 

4.1 Hybrid linear-bilinear model 

Idea of a hybrid linear-bilinear (HLB) model is presented in the Fig. 3. Elementary bilinear 
model EB(k,l), for which is assumed that k<l, and e(i) is an independent white noise 
series, is applied as a part of the HLB. For k<l HLB model may be considered as linear 
autoregressive model stimulated by EB(k,l) series. The hybrid model consists of two parts: 

- linear, that is built on the original data series y(i) : 

dA 

# = -£«#<-, ( 45 ) 

- nonlinear that is built on the residuum wf. 

w ,=y,-yf ( 46 ) 

Residuum wt is described in the following way: 

w i =r li ~V (47) 

where: 

_ = (# d m c <2) for EB(k,k) model 
V for EB (k, I) model 
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and r) i is described by the elementary bilinear model EB(k,l) or EB(k,k): 

\ e i + l3kk e i-kV,-k for EB(k,k) model 



V,= 



£; + l3 kl e ik ri i _ 1 lorEB(k,l) model 



(49) 



Process 



Linear Model 
AR(dA) 



Hybrid LB 



*& 



Elementary bilinear 
model 



-►©- 



Fig. 3. Hybrid Linear-Bilinear model 

The output of the HLB model is the following sum: 



LB L 

v, = y, + v, 



(50) 



Identification of the HLB model is done in three stages. 

1. First stage — data pre-processing — is optional. If the original data set {x(i)j contain 
linear trends, they are removed according to: 



Z.. = X: — X, 



(51) 



If it is necessary, obtained data set z, may be transformed. One of possible data 
transformation is: 



(52) 



Vi = — 

var(z) 
2. The second stage — linear model AR(dA) (53) is identified. 

A(z~ )y t — if, 



(53) 



From the experience follows, that the AR(dA) models satisfying the coincidence 
condition: 



r^j^O iorj = l,...,dA 



(54) 



where: 



N-jti 



HViVi-i 



(55) 
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are not only parsimonious but also have the better predictive properties than the 
AR(dA) models with the full rank. 
3. The third stage — elementary bilinear time series model is identified for residuum Wi in 
a way discussed in section 3. 

4.2 Prediction 

Time series models are mainly applied for signals' prediction. In this section, a prediction 
algorithm derived on the base of HLB model is presented. As it was discussed in the section 
2, elementary bilinear models EB(k,l) and EB(k,k) have different statistical attributes. 
Therefore, prediction algorithms, though based on the same HLB model, have to be 
designed separately for residuum represented as EB(k,l) and EB(k,k). Minimum variance 
prediction algorithms have roots in the following theorems. 
Theorem 1. 

If \/i is a non-Gaussian stochastic time series described by the hybrid model HLB: A(D)y t — r; ; , 
where: 

- residuum n, is represented as a sub diagonal model EB(k,l) and k<l: 

■n i = w i + b kl w l _ k ri i _ l , 

- Wj is an independent white noise series with the variance m^ 1 , 
then the h-step prediction according to the algorithm: 

9i +m = G(D)y, + l3 kl F(D){e;' +h _ kVt+h _,) (56) 

where: 

^=Vi-Vi\i-H' 

gives the prediction error ej = F(D)w i with the minimal possible variation: 

E{ e( '} 2 =mP>(l + £/*| (57) 

In the above equations D - states for a nonlinear delay operator defined as follows: 

D*(y ( ) = y«_ t 
D k (y i x i ) = y,_ i x i _ k 

D "(y,) x ,=y,-k x i> 

A(D), F(D), G(D) - are polynomials in D with degrees dA, h-1, dA-1 respectively. The 
polynomials are related to each other so to satisfy the following equation: 

1 = A(D)F(D) + D h G(D) (58) 
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When residuum is a diagonal EB(k,k) process, the following theorem is formulated. 
Theorem 2. 

If yi is a non-Gaussian stochastic time series described by the hybrid model HLB: A(D)y l — z i , 
where residuum Zi may be presented as: 

z i ~ Vi ~ V i 
V, = w, + PuPi-Mi-t < 

then the h-step prediction according to the algorithm: 

y i+Mi -G(D)y i +P kk F(D)(el h _ k z i ) + z i+h _ k +v+^F(D)(el h _ k ff) + F(D)v (59) 

where: 



e! = rj i — r\ l 



\i-h i 



gives the prediction error: e\ — F(D)w i with the minimal possible variation: 



E{ef} = m ^l + ^ fi 



h-l 



Delay operator D and the polynomials A(D), F(D), G(D) are defined in the same way as in 
the Theorem 1. 

4.3 Prediction strategy 

Prediction strategy means a way of data processing that should be applied to the original 
data series to obtain the accepted prediction. In this section MV -HLB prediction strategy is 
formulated. The strategy has the form of an algorithm built of a sequence of the following 
steps: 

1. The original set of data y i , i-1,.. .,N is divided into two sets: 

- training set, for i — l,...,N trlUn , that is used for model identification, 

- testing set, for i = l,...,N tesf , on which the prediction algorithm is tested. 

2. On the training set, parameters of a linear AR(dA) model: 

Vi = -«i2/j-i - fl 2 y,-2 " - " a M y,-tA ( 60 ) 

are estimated. For further consideration, such models that satisfy coincidence condition 
(54) are accepted only. 

3. On the training set the residuum is calculated according to the equation: 

Vi = V, + «i3/j-i + a 2 y,_ 2 + ... + a dA y { _ dA (61) 
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4. In the following steps 4-7 identification procedures described in details in section 3 are 
realized. 

5. The first, the second, the third and the fourth moments of the residuum r\i are 
estimated. 

6. Identifiability criterion for EB(k,l) process is checked for the series of residuum. If fitting 
elementary bilinear model is possible, one can continue in the step 7. If not, one should 
move to the step 12. 

7. The structure (k,l) of the EB(k,l) model is established on the base of the third moment for 
residuum. 

8. The values of (3 kl and tn„ are calculated using e.g. one of the moments' methods. 

9. For the assumed prediction horizon h and the estimated polynomial A(D) the 
diophantine equation (58) is solved, and the parameters f k , k=l,...,h-l of the 

polynomial F(D) as well as the parameters g., j=l,...,dA-l of the polynomial G(D) are 

calculated. Then, if the prediction horizon /;<min(fc,/), prediction algorithm is 

designed either on the base of the Theorem 1 — for the EB(k,l) model of the residuum, or 
on the base of the Theorem 2 — for the EB(k,k) model of the residuum. 

10. The designed prediction algorithm is tested on the testing set. STOP. 

11. If h > min(fc, /) then move to the step 12. 

12. Design linear prediction algorithm e.g. [1], [4]: j/, + ,,i, = G(D)y i 

13. Test it on the training set. STOP. 

The above prediction strategy was tested for simulated and real world time series. In the 
next section, the strategy is applied to series of sunspot numbers and MVB prediction is 
compared with the non-linear prediction performed using the benchmark SETAR model, 
proposed by Tong (Tong, 1993). 



4.4 Sunspot number prediction 

Sunspots events have been observed and analysed for more than 2000 years. 




Fig. 4. Sunspot events 
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The earliest recorded date of a sunspot event was 10 May 28 BC. The solar cycle was first 
noted in 1843 by the German pharmaceutical chemist and astronomer, Samuel Heinrich 
Schwabe as a result of 17 years of daily observations. The nature of solar cycle, presented in 
the Fig. 4 characterized by a number of sunspots that periodically occurs, remains a 
mystery to date. Consequently, the only feasible method to predict future sunspot number is 
time series modeling and time series prediction. Linear prediction do not give acceptable 
results hence, the efforts are made to improve the prediction using nonlinear models and 
nonlinear methods. Tong (Tong, 1993) has fitted a threshold autoregressive (SETAR) model 
to the sunspot numbers of the period 1700-1979: 



1.92 + 0.84Y M + 0.07Yj_ 2 - 0.32Y,_ 3 + 0.15Y,_ 



-0.20Y 



-0.00Y,_ 6 + 0.19Y ; _ 7 -0.27Y,_ 8 



-0.21Y 



- 0.01Y,._ 10 + e) 



4.27 + 1.44Y / _ 1 -0.84Y,._ 



-0.06Y 



when Y ; _ 8 < 11.93 



when Y «> 11-93 



(62) 



The real data were transformed in the following way: 

y=2(Vt+t;-i) 



(63) 



where y t is the sunspot number in the year 1699+i. 

Based on the model (62) prediction for the period 1980-2005 was derived, and used as a 
benchmark for comparison with the prediction, performed in the way discussed in the 
paper. The HLB model (64) was then fitted to the sunspot numbers, coming from the same 
period 1700-1979, under the assumption that the linear part of the HLB model satisfies the 
coincidence condition. 



(64) 



Y j = 0.81Y,._ 1 +0.21Y j _g + 77 J 

Vi — e , + 0.02j7 i _ 7 e i _ 7 

The Y i s a variable transformed in the same way as in the Tong's model (62), and the 
variance of residuum is var(r;) = 8.13 . 



1981 



prediction 




1984 



1980 1981 

real data 



1982 



1983 



1984 



Fig. 5. Scheme of prediction calculation 

Sunspot events prediction for the period 1981 — 2005 was performed according to the 
scheme showed in the Fig. 5. One step ahead prediction y M u calculated at time i depends on 

the previous data and the previous predictions. Prediction algorithm has the form specified 
in Theorem 2.For the data transformed according to (63) predictions obtained based on 
Tong's model and the HLB model are compared in the Fig. 6. 
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Fig. 6. Prediction for the period 1981-2005 based on Tong's and HLB models 

The HLB prediction is evidently more precise than the one derived on the base of the Tong's 
model. Sum of squares of the Tong's prediction errors was: 

S T =1.07x10*, 
while sum of squares of the HLB prediction errors was: 

S MLB =1.70xl0 3 

Data transformation (63) is not natural for minimum variance prediction. Therefore, HLB 
model was once more identified, for the data transformed in the following way: 



Y 



y,-y 



var(y) 
This time the following HLB model was identified: 



Y ; = 0.80Y M - 0.29Y,_ 7 + 0.52Y._ S 



(65) 



(66) 



Vi — e i + 0.08?7,_ 3 e j _ 3 

and variance of the residuum var(?/) = 0.24. Prediction algorithm was built on the base of 
model (66) in a way specified in Theorem 2. The sum of squares of the HLB prediction 
errors was this time: 

S — 30 

J MLB — JU ' 

hence, higher quality of the HLB prediction was obtained this time than previously. Fig. 7 

illustrates prediction for the period 1981-2005, obtained on the base of Tong's model (62), 

built on the data transformed according to (63), and on the base of HLB model (66). 

Tong (Tong, 1993) after discussion with Sir David Cox, one of the greatest statisticians in XX 

century, defined genuine prediction, as the prediction of data that are entirely not known at 

the stage of prediction establishing. The idea is illustrated in the following scheme, and is 

known also as a multi-step prediction. 

In 1979, genuine prediction of sun spot numbers was established for years 1980 — 1983 on 

the base of Tong, and HLB models. Sums of squares of the prediction errors were equal to 

347 and 342, respectively. The results are showed in the Fig. 9. 
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Fig. 7. Prediction for the period 1981-2005 based on Tong's and HLB models. 
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Fig. 9. Genuine prediction for the period 1980-84 



5. Resume 

In the chapter, a new method of time series analysis, by means of elementary bilinear time 
series models was proposed. To this aim a new, hybrid linear - elementary bilinear model 
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structure was suggested. The main virtue of the model is that it can be easily identified. 
Identification should be performed for the linear and the non-linear part of the model 
separately. Non-linear part of the model is applied for residuum, and has elementary 
bilinear structure. Model parameters may be estimated using one of the moments' methods, 
because relations between moments and parameters of elementary bilinear time series 
models are known. 

Based on HLB model, minimum-variance bilinear prediction algorithm was proposed, and 
the prediction strategy was defined. The proposed prediction strategy was than applied to 
one of the best-known benchmark - sunspot number prediction. Prediction efficiency 
obtained with the use of HLB model, and bilinear prediction algorithm, in the way described 
in the paper, occurred much better than the efficiency obtained on the base of SETAR model, 
proposed by Tong. 
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1. Introduction 



Identification is a process (measured experiment and numerical procedure) aiming at 
determining a quantitative model of the examined object's behaviour. Identification of 
dynamics is a process which tries to define a quantitative model of variation of system state 
with time. The goal in the experiment is to measure inputs and outputs of the examined 
object, the system excitations and reactions. In special cases the model can be treated as a 
"black box" but it always has to be connected with physical laws and can not be inconsistent 
with them. 

The most commonly used models of system dynamics are differential equations - general 
nonlinear, partial, often nonlinear ordinary, rarely linear ordinary, additionally non- 
stationary and with deviated arguments. Sometimes one considers discrete-time models 
presented in a form of difference equations, which are simplified models of a certain kind. 
Integral equations, functional equations etc. are models of a different kind. 
If a model structure is a priori known or if it can be assumed, the identification consists in 
determination of model parameters and it is defined as parametric identification. If the full 
model structure or its part is not known, nonparametric identification has to be used. 
In domain of linear models an equivalence of linear ordinary differential equations is 
transfer function, transient response or frequency response. They can be obtained using 
experiments of various types: passive - observation of inputs or outputs without interaction 
upon object or active - excitation of the examined object by special signals (determined: 
impulse, leap, periodic, a periodic, lottery: white noise, coloured noise, noise with 
determined spectrum...). 

Many possibilities lead to a variety of identification methods. In the last decades various 
identification methods have been developed. Rich bibliography connected with this 
thematic includes Uhl's work (Uhl, 1997) which describes computer methods of 
identification of linear and nonlinear system dynamics, in time domain and also frequency, 
with short characteristic and a range of their applications. 

There are many methods of parametric and nonparametric identifications of linear 
dynamics of systems (Eykhoff, 1980), (Iserman, 1982), (Soderstrom & Stoica, 1989). There are 
fewer useful methods applied for systems with nonlinear dynamics (Billings & Tsang, 1992), 
(Greblicki & Pawlak, 1994), (Haber & Keviczky, 1999), thereby a presented simple solution 
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can be very useful in identification of the structure of a model of nonlinear dynamics of 1st 
and higher orders. 

The method of identification has to be adapted to planned and possible experiments and the 
type of the assumed model structure. Active identification is more precise than that which is 
based on a passive experiment. A parametric identification connects optimisation with the 
regression method which allows to decrease the influence of disturbances and consequently 
increases accuracy of the model parameters. The precision of identification depends on the 
degree of disturbances elimination, errors of measured methods and accuracy of measuring 
devices. 

The input and output are often measured as disturbed signals. The parametric identification 
for non-linear systems based on the method of the averaged differentiation with correction 
was introduced in the paper (Bockowska, 2003). The method of averaged differentiation 
allows to filter distorted signals and to obtain their derivatives. Thanks to the correction 
procedure one can obtain such values of the corrected averaged signals and their 
derivatives, which are very close to their real values. 

2. Averaged differentiation method with correction as regularization filter 

The method of averaged differentiation is a regularization filter which allows to determine 
useful signals and their derivatives based on the disturbed signals available from 
measurement. Operation of averaged differentiation can be used to evaluate a derivative of 
any function of signal and time, and averaged signals can be used in estimation of model 
parameters. Nevertheless its application is not sufficient to determine nonlinearity such as 
multiplication or other non-linear functions of derivatives with different order. The problem 
can be solved through connecting the operation with specially designed procedures of 
correction. Hence one can obtain such values of the corrected averaged signals and their 
derivatives, which are very close to their real values and so can be used to determine 
nonlinearity with different structures, good enough to estimate parameters of nonlinear 
models. 

2.1 Definition of averaged differentiation method 

If the signal x(t) is passing through the window g(v) with the width ±d starting from the 
moment to, Fig. 1, then the output of the window is the signal (Kordylewski & Wach, 1988): 

d 
x(t ) g = Jx(t +v).g(v)dv. (1) 

-d 

If the function x(t) is differentiable, then it x(to+v) can be expanded into Taylor series in the 
neighbourhood of the point to: 



oo l 

(to+v) = X Z r- x(i) ( t o)- (2) 



=o 



Denoting the moments of the measurement window as: 

d 
m i= jV.g(v)dv (3) 

-d 
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the window response takes the form: 



x ( t o)g = Z ii f- x(l, (to) 



i=0 



(4) 



i x(t), g(v) 




t 0+ d 



Fig. 1. The measurement window. 

In general, we can assume that x(t)=y( n )(t). Using the definition (1) and integrating by parts 
n-times we get the formula: 



jy (n) (t +v).g(v) = y( n >(t ) g = (-l) n - jy(t 0+ v).g( n )(v)dv- 



n-1 



^(-ly.^idyy^-^Ht^d^H-dyy^-^Hto-d)}. 



(5) 



If the weight function g(v) satisfies the following conditions: 

g (i) (-d) = g (i) (d) = for i = 0,l,2...n>n-l , 
then Equation (5) can be simplified to the following form: 

d 



(n) (to) g =H) n jy(to + v)-g (n) (v)dv. 



(6) 



(7) 



It allows shifting the differentiation from the signal y(t), which is usually disturbed, to the 
weight function, which is known in an analytical form. In general, this considerably lowers 
the influence of disturbances on the result of differentiation, all the stronger the greater the 
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measuring window width 2d (Bockowska, 1998). So the operation of the averaged 
differentiation can be treated as an equivalent of passing the signal through the measuring 
window described by the equation: 

Sn(t) = g (n) (t)-H) n . (8) 

The symmetry and normalisation of the function g(v) are not necessary, but satisfying these 
conditions is useful for further considerations. Since the function g(v) is an even function 

g(t)=g(-t), forts <-d,d) (9) 

the odd moments are always equal to zero m2i+i=0 and there are no phase shifts between 
signals x(to) and x(to) g . If the normalising condition is satisfied: 

d 

fg(v)dv=l, (10) 

-d 

then the transformation (1) corresponds to the averaging of the signal x(t) with the weight 
function g(v) in the time interval /t - d, t + d^ : 

d 

j"x(t +v)-g(v)dv 

<k)z==^—, ■ (U) 



>g d 

j*g(v)dv 

-d 



2.2 Optimization of realization of averaged differentiation (Bockowska, 2005) 

The operation (1) as well as (7) can be interpreted as a convolution conducted in a 
continuous time domain and it is called the convolution integral. Its value at an arbitrary 
time t is found by integrating the weighted signal by the window g(v) or its derivative in the 
averaging interval (-d, d). It can be implemented using one of the numerical integration 
methods with the fixed step being equal to the measuring step T s or its multiple. Each of 
them introduces their own errors. The smaller the step size, the more accurate the results. 
For any given step size, the more computationally complex the method, the more accurate 
but longer the calculation. 

It can be much easier to handle this problem with discrete techniques. The convolution 
integral can be substituted by the convolution sum: 

y^flo^ Z h (j)yz(l -j) . where h(j) = (-1) 1 • *-li (12) 

j=0 i s 

and M, 1 are accordingly 2'd, t Q divided by the measuring step T s . If y z is the measuring 
signal with N samples the result of the convolution, algebraically the same operation as 
multiplying the polynomials whose coefficients are the elements of signal and window 
vectors, is N+M-l point signal, where the first and last M points may not be useable. The 
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time of completing the convolution in the time domain is directly proportional to the width 
of the window. 

FFT convolution uses the overlap-add method together with the FFT procedure and the 
principle that multiplication in the frequency domain corresponds to convolution in the 
time domain and reverse. This operation can be written down as: 



y« g =ifft(fft(h,M).fft(y z ,M)) 



(13) 



It is faster than standard convolution for the window with the averaging interval longer 
than 64 points, because its time increases very slowly, only as the logarithm of the window 
width M. It produces exactly the same result as the corresponding convolution in the time 
domain and can be more precise, because the round-off error depends on the total number 
of calculations, which is directly proportional to the computation time. The FFT convolution 
called high-speed convolution is the best way to complete the operation of the averaged 
differentiation. 

2.3 Form of measurement window and degree of noise attenuation 

As a weight function g(v) one can use any even function of the form: 



g(v) = k(n).f 1 (v)-[f 2 (v)f 



(14) 



if a function f2(v) is even and satisfies the condition f2(±d) = 0, and k(n) is the normalising 
coefficient. Making the Fourier transformation of the window g(v) one gets the window 
spectrum G(ja>): 



G (j«>) = Jg(v)-exp(-jcov)dv. 



(15) 



-d 



Since the function g(v) is even its Fourier spectrum contains no imaginary part and can be 
written in the form: 



G(jco) = Re G(jco) = 2 ■ g(v) ■ cos(cov)dv 



(16) 



It means that the operation of averaging does not introduce any phase. 
If to a weight function g(v) we assign the log magnitude function: 



G L (o)) = 20-log 10 
then n-th derivative g' n '(v) corresponds to: 



Jg(v)-exp(-jcov)d\ 



-d 



(17) 



G L (<»)n= 2(Mc, g 



L0 



) n ■ f g(v)exp(-jcov)dv 



(18) 
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The normalised frequency is introduced for generality: 

o 2d 
12 = co . 



(19) 
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Fig. 2. The logarithmic characteristics of the derivative spectrum of the Nutall window. 
The exemplary Bode plots in the logarithmic scale GL(0) n for the weight function: 



i ^ k ( r ) 
g(v) = ^^^-cos , 

5V ; d I2d 



(20) 



which corresponds to the Nuttall window (Nuttall, 1981) with r=6 and the normalising 
coefficients k(6)=8/5, are shown in Fig. 2. The plots consist of several lobes and because of 
the normalisation (10) we have G(0)=1 and GL(0) n =0. Let £2o denote the width of the first lobe 
and V denote the averaged rate of the decrease of Gl except for the first lobe. The correct 
reconstruction of the signal x(to) g takes place in the frequency band of the initial part of the 
main lobe to fio while the disturbance attenuation - in the frequency band of secondary 
lobes. 

If the signal y(to) has the spectrum Y(jfi), then the spectrum of the averaged signal y(t ) g has 
the following form: 



Y(p).=Y(jQ).G(jQ). 



(21) 



The error caused by the process of averaging can be evaluated as the deviation of G(jfi) 
froml. If Y(jfi) and Z(jf2) denote the spectra of the signal y(t) and disturbances z(t), 
respectively, then the operation of averaging differentiation of the disturbed signal 
y z (t)=y(t)+z(t) is expressed by the equation: 



, n) (to) g =H) n - Jy(t +v) g (n) (v)dv + (-!)"■ jz(t 0+ v).g( n >(v)dv, (22) 
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which illustrates the process of reducing the effect of disturbances at an adequately selected 
window g(v). It corresponds to the spectrum: 

Y z (jfi) = (j x Q) n x YGfl) x GGfl) + G x ®T x ZQCl) x GQii). (23) 

If the function y' n '(t) is continuous and differentiable, then we can use its expansion in 
Taylor series taking into consideration the conditions (6) and (9). Hence we obtain: 

co d 

y (n) (to) g =y (n) (to) + E^ r y (n+2i) ( t o) + H) n - jz(t 0+ v).g< n >( v )dv . (24) 

This proves that the deformation of the signal y( n )(to) g with regard to the derivative y( n )(to) 
has two components: the first is caused by averaging and the second by the presence of 
disturbances: 

y (n) (t ) g -y (n) (t ) = A 1+ A 2 . (25) 

The first error Ai depends on the smoothness of the signal y( n )(to) and on the values of the 
even moments of the weight function, which for the Nuttall window are given by: 

m 2j =d 2 J.0(2j / r), (26) 

where © are the constant coefficients for the function g(v) in the given form. So it follows 
that the error Ai can be diminished by decreasing of d or by a suitable choice of the form of 
weight function g(v), which allows to decrease coefficients 0. Because of truth of the 
inequality: 

j{z(t + v)} 2 ■ ^ n) (v)Fdv < {z(t + v)f max ■ j{g (n) (v)} 2 dv (27) 

-d -d 

the influence of disturbances can be evaluated by the following measure: 



M»=jjW(v)Fdv=^, (28) 

where ¥ is the characteristic coefficient of the weight function g(v), which is growing with 
the increase of the derivative order n very quickly. The reduction of this error can be done 
by increasing of d and by a suitable choice of the form of the weight function g(v). So it 
follows that the greater d the greater Ai and simultaneously the less A2. The interval of 
averaging and the form of the weight function g(v) should be selected so as to minimise 
both of these errors. The minimisation of considered errors can be done by a suitable choice 
of the weight function g(v) so that coefficients and *P should be possibly small. 

2.4 Designing optimal correction procedure (Bockowska, 2006) 

The form of the known weight function can be corrected through the application of special 
procedures, which can correspond to the following operation: 
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y(t D )k = c -y(to) g + ^EQ -hto + T i)g +y(*o -Ti) g ] (29) 

Z i=l 
Expanding signals y g in the Taylor series we obtain the relation: 

y(to) k =y(t o )-(c +c 1 +c 2 +...)+ 

y (2) (to)-{m 2 -(C +Cj + C 2 +...)/2!+C 1 -T a 2 +C 2 -T 2 2 +...}+ 

y( 4 )(t o )-{m 4 -(C +C 1+ ...)+4!-m 2 -(c 1 -T 2 +...)/2!-2! + C 1 T 1 4 +C 2 T 2 4 +...}/4!+ (30) 

y (6) (t )>6 -(C +C X +C 2 +...)+6!-m 4 .(cj •T 1 2 + C 2 -T 2 2 +...)/ 2!4!+ 

6!-m 2 .(Ci •T 1 4 +C 2 -T 4 + ...)/ 4!-2!+Cj •T 1 6 +C 2 -I! +...}/ 6!+... 

Because of yk(to) = y(t ) the evaluation of the corrector parameters Ci and Ti should be done 
on the basis of the following system of non-linear equations: 

C +C 1 +... + C n =1 

C 1 -T 1 2 +C 2 -T 2 2 + ... + C n -T 2 = 2!-M 2 =A 2 



C a -T? +C 2 -T 2 4 + ... + C n -T 4 = 4!-(m 2 ! -M 4 )= A 4 



C 1 -T l 6 +C 9 -T 7 6 +... + C„-T° 



(31) 



. r i, +^ 2 -i 2 +... + C n -T^=6!-(lVl|-2-M 2 -M 4 +M 6 )=A 6 
Cj ■ if + C 2 ■ T| + . . . + C n ■ T* = 8!-(m| +2 ■ M 2 ■ M 6 + M 4 - 3 ■ MJ ■ M 4 - M 8 ) = A 8 
C 1 -T 1 10 + C 2 -T 2 10 +... + C n -Tn° =10!-(m 2 -2-M 2 -M 8 -2-M 4 -M 6 +3-M|-M 6 + 
3 ■ M 2 ■ M 4 - 4 • M 4 ■ M 2 + M 10 ) = A 10 

The system of Equations (31) is non-linear with unknowns Ci, Ti and it is not clear that it has 
real solutions for arbitrary n and a given window. 

In the identification of the system dynamics we want to obtain such values of the corrected 
averaged signal and their derivatives, which are possibly close to their real values. The 
corrected signals should be useful to obtain for example the following nonlinearities: 

y( n )(t).y( k )(t) S y^(t).y( k )(t) (32) 

where yk( n )(t) and yk' n )(t) are set on the basis of the measured signal y(t). To meet this goal 
possibly more equations from system (31) must be fulfilled. The first consecutive equations 
allow to obtain the unknowns Ci (Bockowska, 2003). The evaluation of the time constants Ti 
must be done minimizing the rest of equations for a given form of the window, measuring 
step and the used width d of the averaging interval. The time constants Ti can be expressed 
depending on the width of the averaging interval d as: 

Ti=T di -d (33) 

so Tdi and corresponding Ci are the corrector parameters. The correction procedure must be 
unique for different kinds of windows and their chosen order and width. The course of the 
correction will be considered for the cosine window known as the Max Rolloff 4-Term or 
Nuttall window of 6 th order and the correction procedure of 2 nd order in the form: 
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Yk(t ) = c o -y(t ) g +^Z c i • k*o + T i)g + y( t o ^ T i)gl- 

Z i=l 
The corrector parameters Q are determined by the following relations: 

c 1 = (a 2 .t 2 2 -a 4 )/(t 1 2 -t 2 2 -t 1 4 ) 
c 2 = (a 4 -a 2 .t 2 )/(t 2 4 -t 2 .t 2 ) 

C =1-C 1 -C 2 



(34) 



(35) 



which follow from the first three equations of the system (31). The determination of the time 
constants Tj requires optimisation. After introduction of two auxiliary variables: 



x = Tj and y = T 2 



(36) 



the next equation can be written as a relation, which is linear and symmetrical in regards to 
x and y: 



f(x,y) = y-A 4 -y x-A 2 +x-A 4 -A 6 =0 



(37) 



As it is shown in Figure 3 Equation (37) is never satisfied for the positive x and y and has to 
be omitted. The fifth equation from the system (31) is a quadratic equation with two roots: 

g(x,y) = y 2 A 4 -y 2 x-A 2 +yx-A 4 -y-x 2 A 2 + x 2 A 4 -A 8 =0 (38) 

The positive one can be taken into consideration. It can be written as the next function: 



2 A 2 
4 



x(y) = [-y 2 -A 2 +y -A 4 -(y 4 -A 2 . +2-y 3 -A 2 ■ A 4 -3-y 2 ■ A 
-4.y.A 2 .A 8+ 4.A 4 .A 8 ) 1 / 2 y2.(yA 2 -A 4 ) 



(39) 




Fig. 3. Relation f(x,y) defined by Equation (37). 
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Fig. 4. Relation g(x,y) defined by Equation (38). 

which allows us to obtain the allowed interval of the variability of y and x, because their 
changes are symmetrical. The additional condition is needed to obtain both x and y: 



Ci +C, 



(40) 



which follows from the postulate of the minimal increase of the disturbances participation. 
Taking into consideration (35) and (39) it can be written down as the function of y: 



h(y) = 



y-A 2 -A 4 



*(y)-(y- x (y)) 



x(y)-A 2 -A 4 



(y-x(y)) 



x(y)-A 2 -A 4 +y-A 2 -x(y)-y 



x (y)-y 



(41) 




Fig. 5. Variability of the auxiliary variable x and function h(y). 

The above derivation is valid for corrector of 2 nd order of any window, but presented graphs 
correspond to Nuttall window of 6 th order. The determined values of the coefficients A2,4,6,8 
and corrector parameters Co,i,2 and Ti,2 for the Nuttall window of 1 st to 6 th order are shown 
in Table 1. The presented values are obviously functions of the window width d. 
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R 


A 2 


At 


A 6 


A 8 


Co 


c 2 


c 2 


T d i 


Tii 


1 


-0.189 


0.137 


-0.207 


0.547 


1.773 


0.21 


-0.983 


1.139 


0.686 


2 


-0.131 


0.061 


-0.058 


-0.091 


1.878 


0.251 


-1.129 


0.89 


0.54 


3 


-0.099 


0.034 


-0.023 


0.025 


1.977 


0.265 


-1.242 


0.75 


0.447 


4 


-0.08 


0.022 


-0.011 


0.009 


2.013 


0.316 


-1.329 


0.648 


0.4 


5 


-0.067 


0.015 


-0.006 


0.004 


2.108 


0.29 


-1.398 


0.589 


0.346 


6 


-0.057 


0.011 


-0.004 


0.002 


2.119 


0.330 


-1.449 


0.533 


0.323 



Table 1. The coefficients A and corrector parameters C and T for the Nuttall window of l sl to 
6 th order. 

Figure 5 shows the variability of the slack variables x, y and thereby the time constants Tj 
and T 2 . The minimum of function h(y) is for ysO.29 or 0.10 and consequently T2SO.53 and 
0.32. The time constant Ti obtained from Equations (41) and (36) is Ti=0.32 or 0.53. Because 
the product of the time constants and the width d of the averaging interval expressed in the 
samples must be integer, so they have to be recalculated using the following relation: 



irdi 



round (T di d) 



(42) 



The correction procedure of 2 nd order corresponds to the change of the basic window 
spectrum to the following form: 



G z (co) = G(co) •Gk(co) = G(co) 
Introducing the auxiliary variable: 



C 0+2 C i' COS ( m ' T i' 



(43) 



2-d 
the definition (15) can be rewritten in the following form: 



7t/2 



G(fi) = — • [g(u)-cos(Q-u)du 



(44) 



(45) 



and the spectrum of the corrected window assumes the form: 



G z («) : 



C +£c r cos(QT ui ) 

i=l 



7l/2 

[g(u)cos(Qu)du 





(46) 



The relation (46) allows to obtain the maximum normalised frequency Q.^ which can be 
transferred with the assumed accuracy a through the corrected window, solving the 
following equation: 



a = l-G z (fi) 



(47) 



The evaluation of the corrector parameters G and Ti should be done on the basis of two 
criteria: the minimisation of the deviation G(jf2) from 1 in the pass band and the magnitude 
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Q n G(jfi) in the stop band. The application of the procedures is a trial of the window 
approach to the ideal window with wall spectral properties. 

The correct evaluation of the pass band frequency f p is the next important problem, which 
decides about the correctness of the processing of the measured signals and has big 
influence on the quality of the identification. The pass band frequency is a filter feature and 
for a low-pass filter assigns the maximum frequency, which as a component of a useful 
signal is transferred without deformation. It determines the transfer band frequency being 
its upper limit. It is a basis to determine the filter parameters and proper choice of corrector 
parameters, if a valuable information of signal is lost or if a processed signal is deformed by 
noise. It decides about the credibility of processed signals as the material for identification of 
dynamics of systems. The pass band frequency f p can be determined on the basis of the 
output spectrum which is more complex than the input spectrum, because of the influence 
of the system's dynamics. 





' a,s 


o a 1 




1,5 


Q. 






: ' n 








■fl,1 - 




^^ V a 


















-0,2- 














T- 































Fig. 6. Accuracy a of transmission of frequencies lower than Q a by the corrected window 
with spectrum G Z (Q). 

If a measurement experiment is carried out on the same examined system using the same 
instruments, but input with different parameters, then one can use the power spectral 
density of outputs assigned for two extremes of inputs. The power spectral density is 
suggested in order to increase the spectrum resolution and is defined as follows: 



GWM=Y-conj(Y), 



(48) 



and the frequency axis is defined in Hz according to f=(0:N-l)/(N T s ),where N is the 
number of samples, T s - time step between consecutive samples. The frequencies of 
dominant peaks can be determined through an analysis of the graph of the difference of 
power spectral density both of outputs. 



AG=GWM 1 -GWM 2 . 



(49) 



If fi a and f p are known then the required width of the measurement window can be 
calculated according to the following relation: 
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^opt 



4-f„ 



(51) 



The determination of Q a can be done very easily and quickly if the analytical relation 
n a =f(a) fulfilled for small a, for example 0,01 < a < 0,1, for a given kind of the measurement 
window is known. An example of such relation for a given window is shown below. 
Allowing for the correction procedure of 2 nd order defined by (34) the spectrum of the 
corrected window of r order gets the form: 



G(fi) = 



2 . (n-n 

sin 

n-Cl 



l-mod(r,2) 



7t-n 



mod(r,2) 



C + £c r cos(o-T ui ) 



n 



iH Q 



l-mod(r,2)-mod(n,2) 



(52) 



and corresponding plots for r=1^6 are shown in Figure 7. 

Figure 8 presents variability of Q. a versus a in the range 0.01 < a < 0.1. The relationship 
fi a =f(a) in an analytical form is needed to calculate quickly the optimal width of the 
measurement window d for a given output and its pass band frequency f p according to (51). 
The approximation of this relation can be evaluated using (51) and (47). The courses in the 
given range were approximated by an exponential function: 



n 



aapr 



:S-C^ 



(53) 



with very good fitting using the regression method. The results: the values of the coefficients 
S and P as well as correlation coefficients R are shown in Table 2. The mean value of the 
exponent P„, is equal to 0.197 and was accepted as constant. The new values of coefficients 
S m for P„, are also presented in Table 2. For S,„ and r the quadratic dependence was found as 
it is shown in Figure 9. 




Fig. 7. The spectrum of the corrected Nuttall window of 1 st to 6 th order versus CI. 
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Fig. 8. The normalised frequency Q, versus the accuracy a of transmission of the useful 
signal. 



r 


1 


2 


3 


4 


5 


6 


S 


4.9755 


4.5838 


4.1065 


3.6324 


3.0745 


2.4184 


P 


0.1991 


0.2006 


0.1971 


0.1982 


0.1961 


0.1882 


R 


0.9997 


0.9997 


0.9996 


0.9997 


0.9998 


0.9995 


^m 


4.941 


4.529 


4.105 


3.618 


3.083 


2.49 



Table 2. Parameters of the approximate exponential function to the order r of the 
measurement window. 



-0 ; D24r' + 0,e6r + 1.86 
R 2 = 0,9939 



Fig. 9. Variability of S m versus r. 

The result approximate relation, who determines the variability of Q, a versus r, is as follows: 



Q 



a apr 



: (-0.024 r z + 0.66 r + 1.86) a 



0.197 



(54) 



The correlation coefficients between the data, which was obtained on the basis of the 
corrected window spectrum and the other calculated on the basis of the above mentioned 
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Equation (53), are still close to unity. The needed width of the measurement window can be 
expressed in seconds, using (54) and measurement step relating to: 



A opt 



(-0.024 • r 2 +0.66 -r + 1.86) -a 0197 
(4 -f. step) 



(55) 



3. Identification of non-linear system dynamics 

3.1 Parametric identification 

The proposed procedure, the averaged differentiation operation with correction, can be 
applied to evaluation of the parameters of the model of the system dynamics if the model 
structure is known and corresponds in general to the differential equations of the kind: 



(56) 



y( n )(t) = F(y,y( 1 ),...,y( n - 1 ),x,x( 1 ),...,x( n ),A ,...,B ,...,D) 
which can be written down as the following equation: 

y< n >(t) = f>j ■F jx (jr,...y< n - 1 U...x( n >)-x<'> -XB r F jy (y,...y( n - 1 ),x,...x(")).y(J) +D (57) 
j=0 j=0 

Taking into account all the measured data samples N, to which the model with the 
parameters Aj, Bj and D must be fitted we can present the system in the matrix form: 



Yn = XY ■ PAR, where Yn 







^A ~ 


( n ) 

yi 

v (n) 


, PAR = 


A n 
Bo 


(n) 




B n-1 

D 



XY: 



F 0x,l' x l - F nx,l- X i F 0y,l-yi 

F 0x,2- X 2 - F nx,2- X 2 n) F 0y,2 ' Y2 

V V ( n ) V 

t '0x,N' x N -■ r nx,N' x N f 0y,N ' YN 



F »y,ryr 



(n-l) 



u ny,. 



FII 
ny,2-y 2 



(n-l) 



F ny,N-yN ^ 1 



(58) 



The system (58) with respect to some or all parameters Aj, Bj and D can be non-linear or 
quite linear. In case of nonlinearity, non-linear optimisation has to be applied, for instance 
by the simplex search method. An initial estimation of the parameters can be carried out 
either by omitting nonlinearities, if this is possible, or by using another estimation 
procedure. It is advisable to try several sets of starting values to make sure that the solution 
gives relatively consistent results. The obtained result for a given optimal averaging width 
d op t of the measurement window g(v) does not have to be an optimum. The verification of 
the result of the identification, the model parameters, can be done through the 
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determination of the mean square deviation between the output signal y and the model 
response obtained through the simulation y s for the assigned parameters: 

N 

T = Z(y 8 -y) Hy s -y| . (59) 

i=l 

Both signals y and y s must be averaged and corrected. The minimum of criteria J proves the 
result of non-linear optimisation and correctness of the choice of d opl . 

3.2 Nonparametric identification of nonlinear dynamics of 1 st order of systems based 
on the active experiment (Bockowska & Zuchowski, 2007) 

Despite the fact that the presented method is nonparametric, its usefulness is limited to 
systems dynamics with structure described by the following differential equation: 

y (1) -q>(x,y) + y-f(x,y) = x. (60) 

Thanks to the fact that the measured window is an even function, the method of the 
averaged differentiation does not introduce any time shift between signals: the signals 
measured for example y< 1 )(to) and those who were averaged yW(to)g. The corrected signals 
are not shifted in relation to the measured signals, because the correction procedure is also 
even. 

An active experiment ensures that a rich spectrum signal is fed into the system input. A rich 
spectrum can be achieved when a periodic signal with a modulated amplitude and 
frequency is applied, for example: 

x(t) = A(t)-sin(t-co(t) + (j) ) . (61) 

Averaged and corrected courses of signals x(t), y(t) and yW(t) should be obtained and the 
plots of these courses created. The time moments for which y(!)(t)=0 can be easily obtained 
and correspond to the points of the static characteristics of the examined object: 

yf(x,y) = x. (62) 

Next, the values of the second unknown function q>(t) can be defined using all the values of 
both averaged and corrected signals. 

The goal of the presented method is not the determination of the static characteristic, but 
that of the function f(x,y). These two can produce the same or similar graphs, but they are 
not always the same mathematically. It means that the structure of the function f(x,y) can 
not be concluded from the relation y(x) without additional assumptions. The functions f and 
cp can be the functions only of x or y or of both signals together. Hence, various models can 
be created and some of them differ from the true model. The regression method should be 
used in order to determine the functions f and cp on the basis of the plots of the following 
relations: 

f( x ,y) = Zl =-, (p(x,y) = z 2 = (1 (63) 

y y W 

in the co-ordinate system x, y, zi and x, y, Z2. An input with a rich spectrum enables to 
obtain a big collection of different points x, y, zi, Z2. 
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The examined object can have the dynamics of an order higher than one. In this case the 
obtained static characteristic is different from the real one and the change of the form of the 
signal x(t) leads to other results. Hence, it is a good control test of the correctness of the 
dynamics order. 

The identified model should be obviously verified using numerical simulation. 
A comparison between simulation results and measured data should also be carried out. 

3.3 Nonparametric identification of nonlinear dynamics of 2 nd order of systems 

The consideration relates to the models of system dynamics with the following structure: 

y( 2 )-F 2 (x / y / yW)+y( 1 '-F 1 (x,y,y( 1 ))+F (x,y,yW)=x. (64) 

The averaged corrected values of signals x, y, yffl and y( 2 ) are used. At first the relation 
corresponding to static characteristics should be determined - a steady state, when y( 1 )(t)=0, 
yP)(t)=0 and x(t)=x=const.: 

F (x,y,0)=x. (65) 

Usually the function Fo(x,y) is independent at yW(t), hence the substitute relation can be 
obtained: 

y = <Kx). (66) 

Next, the values of x, y corresponding to the extreme values of the first derivative should be 
chosen, for which: 

y( 2 >(t) = 0. (67) 

The knowledge of the function Fo(x,y), which was determined in the first step and multiple 
chosen values of yW e x in the second step allow us to evaluate the structure of the function 
Fi(x,y,y(i)): 



hfa.7 m )- 



x-F (x,y) 
y«e* 



In a particular case, if Fi(x,y,y( 1 )) is the function only of the output, the relation (68) is 
obtained uniquely: 

F.to = ^fe>. (69) 

y l 'ex 

In different cases the additional conditions should be assumed a priori. 

The finding of the values of x, y and y( 2 ) for y( 1 )(t)=0 allows us to choose the structure of the 

last function: 

The result of non-parametric identification is unique if the model of system dynamics can be 
described by the following differential equation: 
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y (2) -F 2 (y)+y (1) -F 1 (y)+F ( y )=x. (71) 

The accuracy of this process is defined by the accuracy of the operation of averaged 
differentiation with correction. 

3.4 Nonparametric identification of nonlinear dynamics of order higher than 2 nd 

For the systems, which model has the analogue structure to (71) but is the higher order the 
parametric identification is necessary. Let us now consider: 

y< 3 > •F 3 (y) + y( 2 ) ■F 2 (y)+y( 1 ) ■F 1 (y)+F (y)=x . (72) 

After the determination of the static characteristics and definition of the relation: 

F (y)=x (73) 

the values of x, y corresponding to the extreme values of the first or second derivative 
should be chosen, because both of these conditions are fulfilled very rarely. Supposing that 
y( 3 )(t)=0 we obtain: 

y (2) ex-F 2 (y)+y( 1 ).F 1 (y)=x-F (y). (74) 

The structure of the functions F2 and Fi can be assumed as follows: 

n m 

F 2 (y)=Z a I -y i ' F i(y)=Z b i-y i (75) 

i=0 i=0 

and obtained using the regression. However if y( 2 )(t)=0 we obtain: 

y (3) ex-F 3 (y) + y( 1 ).F 1 (y)=x-F (y) (76) 

and the function Fi can be treated as determined. Consequently, the form of the function F3 
can be found. In case of models of a higher order the procedure can be analogous. 

3.5 An optimal degree of complexity of a model 

In all the cases described above the analytical form of the functions Fo, Fi, ... Fk is evaluated 
using the appropriate set of data, supporting the base functions: 

Fk(y)=Zc 1 .f 1 (y) (77) 

i=0 

and regression method remembering that the variables x and y are known with the certain 
accuracy Ax and Ay. If the measured accuracy of variables can be obtained the optimal 
degree of complexity of the model can be determined. Assuming that the model with the 
degree of complexity n in the form: 

Fn(y)=f>y (78) 

i=0 
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was determined by regression based on the minimisation of the error: 

D 2 (ci) = j F^-fq-yH -dy, (79) 

hence the following equations are satisfied for i=0,l,...,n: 

^i) = = -2.j|F(y)-ic i y i J.y i .dy. (80) 

If the values of y are known with the accuracy Ay=const or Ai=Ay/y=const then the 
application of the model (78) is connected with the additional error, for the small Ay defined 
by the relation: 

F n (y + Ay) = F n (y) + Ay.%^ = F n (y) + Ay.Xi-c i -y i " 1 (81) 

d y ito 



F n (y + Ay) = F n (y) + Ay.%^ = F n (y) + A 1 .Xi.c i .y i . (82) 

d y to 

The square error is as follows: 

ZV(«) = (Ay) 2 {{£ C ,y'} 2 -^ or D a 2 (n)=A 1 2 j|fc r y i | -dy, (83) 

A total model error: 

D 2 (n) = D min 2 (n) + D a 2 (n) / (84) 

depends on its degree of complexity n. If for the supported Ay or Ai as a result of the 
calculations one gets D 2 (n+1) > D 2 (n), then the degree of model complexity n is optimal. 

4. Conclusion 

The presented method can be useful in identification of the structure of a model of nonlinear 
dynamics of 1st and higher orders. The advantage of the proposed solution is its simplicity 
and possibility of evaluation of its accuracy. The application of the averaged differentiation 
with correction ensures one to evaluate an averaged signal and its derivatives close to their 
real values. A novelty is the proposed procedure of the correction of the averaged 
differentiation. The method allows to obtain a structure of the model of the nonlinear 
dynamics of 1 st or 2 nd order. The advantage of this identification is the fact that it can be 
used even if the measured signals are disturbed and the dynamics is nonlinear. In case of 
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dynamics of a higher order the application of parametric identification to define a part of 
model structure is the solution. 
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1. Introduction 

Problems of determining group judgement have been widely explored and many methods 

have been proposed (Nurmi, 1987). It is due to the fact that there is no "ideal" method 

satisfying all the requirements formulated. Hence new methods - possessing desirable 

properties and avoiding deficiencies of the previous ones - have been developed. 

To efficiently analyse and solve problems of determining group judgement usually some 

simplifying assumption are introduced. 

Very often it is assumed that no equivalent alternatives can appear in experts' judgements. 

Such an assumption is adopted, despite of the fact that in real life problems experts are not 

always able to uniquely determine the order of alternatives with respect to the given 

criterion or set of criteria. 

Almost all the methods of determining group judgement are based on the assumption that 

there are no ties in this judgement. This assumption seems to be more restrictive than the 

previous one, and may strongly influence the solution obtained. This is specially important 

in the case of distance-based methods i.e. making use of the concept of distance among 

preference orders (Cook, 2006; Cook et al., 1997; Cook & Seiford, 1978). 

In further considerations experts' judgements given in the form of preference orders are 

taken into account only. 

One of the methods making it possible to overcome the problem of ties is to use the 

approach proposed by Armstrong et al. (1982). 

It will be shown that some distance-based methods of group judgement derived for the case 

of no ties can be extended to the case of ties in experts opinion as well as in group 

judgement. 

2. Positions taken by alternatives and the table of structures 

Assume that there is a set of n alternatives O = {Oi, . . ., O n } and K experts, who are asked to 
order this set with respect to a given criterion (or criteria). Expert present their judgements 
in the form of preference orders 

P k = {O i]/ ...,O i J,k=l, ...,K, (1) 

where an alternative regarded as the best one takes the first position and that regarded as 
the worst one takes the last position. 
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This preference order can be also written as a ranking: 



(2) 



where qf denotes the position taken by the i-th alternative in k-th expert's judgement. 
One can also assume that in an expert judgement as well as in the group judgement more 
than one alternative can be put in the same position, i.e. ties can occur. In this case 
a preference order may be given as follows 



O; ,...,(0; ,...,0; ),...,0[ , r tied alternatives are given in brackets. 



(3) 



Example 1. 

P = { 2 , (Oi, 3 , 4 ), 5 } may be written as P = { 2, 1, 2, 2, 3 }. We shall refer to this notation 

as to the classical one. It is sometimes called a dense ranking. 

Cook & Seiford (1978) proposed to apply fractional ranking, i.e. when r alternatives are 

placed in the same - called it p - position, it is assumed that they take positions 

(p, p+1, . . ., p+r-1) and the fractional position assigned to them is the mean 



t _ p + (p + l) + ... + (p + r-l) _ 2p + (r-l) 
r 2r 



r=p+- 



(4) 



It should be emphasized that the expression obtained is of the form v+Vi for any even r and 

is an integer otherwise; where p, r, v are integer numbers. 

For the preference order given in Example 1 one has P = { 3, 1, 3, 3, 5 }. 

Hence in the fractional notation alternatives can take positions from the following set: 



<F 



1, lVz, 2, 2Vi, 3, 3Vi, , n-1, n-Vi, n! 



(5) 



One should notice that there may be positions with no alternatives assigned to. 
The number of positions (with respect to classical notation) is increased almost twice (2n-l), 
but for given n it is fixed. This property is important for construction of the table of 
structures described later on. An example explaining the use of this notation is given in 
Table 1 (Z denotes the sum of the positions). 



Preference orders 


Positions 


classical notation 


Z 


fractional notation 


Z 


Pi: 


O2, O3, O4, Oi, o 5 


4, 1, 2, 3, 5 


15 


4, 1, 2, 3, 5 


15 


P 2 : 


(0 2 ,0 3 ), O4, (Oi, o 5 ) 


3, 1, 1, 2, 3 


10 


m, IV2, IV2, 3, m 


15 


P 3 : 


O2, (O^OsAOA; 


2, 1, 2, 2, 3 


10 


3, 1, 3, 3, 5 


15 


p 4 . 


O2, (Oi, o 3 , o 4 , o 5 ) 


2, 1, 2, 2, 2 


9 


3Vi, 1, 3Vi, 3Vi, 3Vi 


15 



Table 1. Classical and fractional notation of positions of alternatives. 

It can be easily shown that the number of equivalent alternatives taking a given position t 

can be different. 

Assume the number of position is t=4. 

Let's consider the following preference orders of seven elements: 
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1. (Oi, O2, 3/ O4, Os, Ofi, O7) t 1= (l + 2 + 3 + 4 + 5 + 6 + 7)/7 = 4 

2. Oi, (O2, O3, 4 , O5, O s ), O7 t 2 =(2 + 3 + 4 + 5+6)/5 = 4 

3. Oi, O2, (O3, O4, O5), 6 , O7 t 3 =(3 + 4 + 5)/3 = 4. 

4. Oi, 2 , 3/ 4 , O s , Oe, 7 t 4 = 4 . 

Hence for all the preference orders tied alternatives take the fourth position. In order to 
precise the number of equivalent alternatives taking a given position one has to provide an 
additional information on a position (in the sense of (4)) taken by the first alternative from 
the group of equivalent ones. This position is called level and is denoted by I . It is evident 
that I = 1, . . .,n. 

The level I =1 is assigned to the group of equivalent alternatives such that the first 
alternative is located in the first position. The level £ =2 defines such groups of equivalent 
alternatives for which the first alternative takes the second position, etc. 
This approach makes it possible to define the concept of structure. 

A group of positions (in the sense of (4)) taken by equivalent alternatives is called structure 
and is denoted by S", because it depends on the position t as well as on the level i . The 
number of positions corresponding to a given structure is denoted as s, t . It is evident that 

l<s, t <n . The concept introduced is, in the authors opinion, more useful for solving the 
problem under consideration than that given in (Armstrong et al., 1982). 
The application of structures S enables one to define a table whose rows correspond to 
structures related to a given level i =1, ..., n and columns to a given position t=l; IV2,..., n. 
In a preference order with ties positions taken by alternatives are of the form "an integer or 
an integer + Vi" . Hence it is easy to transform them into the integer domain (needed to 
formulate integer programming problem (Section 5); the numbers of positions are doubled 
(T=2t). The numbers in the shaded area denote positions (in the classical sense) that can be 
taken by alternatives. 



T 


2 


3 


4 


5 


6 


7 


8 


9 


10 


t 


1 


1,5 


2 


2,5 


3 


3,5 


4 


4,5 


5 


1=1 


1 


(1,2) 


(1,2,3) 


(1,2,3,4) 


(1,2,3,4,5) 


(1,2,3,4,5,6) 


(1,2,3,4,5,6,7) 


(1,2,3,4,5,6,7,8) 


(1,2,3,4,5,6,7,8,9) 


1=1 






2 


(2,3) 


(2,3,4) 


(2,3,4,5) 


(2,3,4,5,6) 


(2,3,4,5,6,7) 


(2,3,4,5,6,7,8) 


(=3 










3 


(3,4) 


(3,4,5) 


(3,4,5,6) 


(3,4,5,6,7) 


C=4 














4 


(4,5) 


(4,5,6) 


1=5 


















5 



Table 2 . The table of structures S' ' for £ =5 and t=l, iy2,...,10. 

The bottom edges of shaded area, for t being integer, correspond to preference orders with 
no ties. The detailed description of the table of structures introduced is given in (Bury & 
Wagner, 2007a). 

It should be emphasized that one of the structures from the level 1 must appear in the 
preference order. Moreover, it should be noted that for any level £ >1 only one structure, if 
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any, can be taken into account. The latter holds true also for any position t. It is worth to 
note that at a given level £ the alternatives can take positions T=2 £,..., n+ £ and the 
number of positions s n to be taken into account for given £ and T is equal to 



where 



s <T =T-2 £ +1, for £j. < £ < '• 



= max(l,T-n), t T =[T/2]. 



(6) 



(7) 



The number of levels at which alternatives can be placed depends on the number of 
alternatives considered n and position T. 



T 


2 


3 


4 


5 


6 


7 


8 


9 


10 


k 


1 


1 


1 


1 


1 


2 


3 


4 


5 


77 


1 


1 


2 


2 


3 


3 


4 


4 


5 



Table 3. The levels that should be taken into account for subsequent positions T for n=5. 

For given T, £ and n one can determine structures and number of positions to be 
considered. 



T 


2 


3 


4 


5 


6 


7 


8 


9 


10 


t 


1 


1,5 


2 


2,5 


3 


3,5 


4 


4,5 


5 


e=i 


1 


2 


3 


4 


5 










(=2 






1 


2 


3 


4 








(.=3 










1 


2 


3 






<=4 














1 


2 




(.=5 


















1 



Table 4. Values of s /T for n=5. 

For the preference order P 1= {02, O3, O4, Oi, O5} the table of structures is as follows 



t 


1 


1,5 


2 


2,5 


3 


3,5 


4 


4,5 


5 


1=1 


2 


















1=2 






O3 














(=3 










4 










e=A 














Oi 






e=5 


















O5 



Table 5. Positions and levels of alternatives for the preference order P 1 . 



3. Determining group opinion 

Methods of group judgement based on the concept of distance consist in determining 
a preference order P that is the closest one - in the sense of a distance assumed - to the 
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given set of expert judgements presented in the form of preference orders. In general one 
has to solve the following problem 



min£d(P k ,P)-> P. 



(8) 



This problem can be solved by means of searching over the set of all the preference orders 
that can be regarded as a group judgement and choosing one (ones) that minimizes the 
distance assumed. However, this approach is limited by the total number s n of preference 
orders to be considered. It is growing fast with n and equals to 



k=l 

where S n k is the Stirling number of the second kind (Lipski & Marek, 1985). 
This number can be estimated (Bailey, 1998) as 



(9) 



Sn+l 



(" + !) 
log 2 



s n =l,442695(n + l)sn. 



(10) 



No of alternatives 


No of preference orders - no ties 


s n - total number of preference orders 


3 


6 


13 


4 


24 


75 


5 


120 


541 


6 


720 


4 683 


7 


5 040 


47 293 


8 


40 320 


545 835 


9 


362 880 


7 087 261 


10 


3 628 800 


102 247 563 



Table 6. Number of preference orders to be considered with respect to the number of 
alternatives with no ties or with ties allowed. 

For n=17 the total number of the preference orders to be considered is expressed by 18-digits 
number. Hence instead of searching through all the preference orders, it is worth to apply 
more specialized approach, e.g. to formulate a constrained optimization problem which can 
be regarded as a modification of the linear assignment problem. 

If ties are allowed in the group judgement then the problem is to be modified according to 
the table of structures related to the problem under consideration. Moreover, some 
additional constraints are to be added. The optimization problem with ties was investigated 
by Cook & Seiford (1978), Armstrong et al. (1982) and Bury and Wagner (2007a, b). 
The framework described in Section 4 was applied to formulate and solve the problem of 
determining the Cook-Seiford median, the Litvak median as well as the Kemeny median. 
The results are presented in Section 5. For the first two problems the distance is defined with 
respect to the positions of alternatives. For the Kemeny median the distance is based on 
pairwise comparisons of alternatives and differs significantly from the previous ones. 
However a general framework formulated can be applied in all the cases mentioned. 
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4. Framework for preference structure description 

The following binary variables are to be introduced: 



1 if O; takes the position T at the level I in the preference order P 

otherwise 



(11) 



fl if Oj takes the position T in the preference order P 

1 n otherwise 



1 if the structures' 1 appears in the preference order P 

otherwise 



(13) 



II if a structure from the level t appears in the preference order P 

[0 otherwise 

It follows from table of structures that for a given position T structures can be chosen from 
levels t such that i T <f. <f. T , ij,i T are given by (7) and 

y iT = I>-t • (is) 

The constraints are as follows: 

V Zy, T =i, (16) 

i=l,...,n T=2 

i.e. a given alternative can be placed in one position only. 

V V_I>?T = s fT Y, T , (17) 

T=2,...,2n 1=1^...!.-, i=l 

i.e. the number of alternatives that can be placed in a given position T on a given level I is 
equal to zero or s, T , where s, T (6) is the number of positions corresponding to a given 

structure S . 

V V Zy.W.t, (is) 

i=l,...,n T=2,...,2n (=l T 

i.e. for a given position an alternative can be placed at one level at least. 

V A,= |>„, (19) 

1=1, ...,tl T=2t 

i.e. for a given level I , £ >1 only one structure, if any, can be taken into account. 

For one of the structures from level i. =1 must appear in the preference order, then Ai=l and 
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V A.=Z^,^-i for/-2,...,n, (20) 

l=2,...,n \=\ 

i.e. the possibility of the occurrence of I -level structure is determined by the structures 
chosen on preceding levels. 

It is worth to note that if for any reason no tied alternatives should occur in group 
judgement it suffices to impose A, =1 for I =1, ..., n in (20). 

5. Optimization problem 

As it was mentioned in Section 3 the group judgement based on the concept of distance is to 
be derived as the solution of an optimization problem (8). A solution obtained depends 
upon the assumed definition of the distance as well as of the form of the preference order P 
searched for. 

The median distance is understood as a I 1 norm defined in an adequate space. Three cases 
are to be considered: 

• the space of positions taken by alternatives for the Cook-Seiford median, 

• the space of preference vectors for the Litvak median 

• the space of pairwise comparisons for the Kemeny median. 
This problem will be analyzed for two cases: 

a) there are no ties in expert as well as in group judgement (complete ordering), 

b) tied alternatives can occur both in expert and in group judgement (weak ordering). 

The CPLEX optimization software was used to solve discrete optimization problems 
formulated. 

5.1 No ties case 

For the case of no ties in expert opinion as well as in group judgement the optimization 
problem (8) can be formulated as follows (Cook & Seiford, 1978; Hwang & Lin, 1987; Nurmi, 
1987; Bury & Wagner, 2000) : 
minimize the distance defined subject to following constraints: 

• any given alternative Oi (i=l, . . ., n) can be placed in one position only, (21) 

• the number of alternatives that can be placed in a given position is equal to one. (22) 
To avoid ambiguity it is assumed that positions the alternatives can take in experts' 
judgements are denoted as j, j=l, . . ., n. 

The assumption of no ties in expert judgement will be relaxed for the Litvak and for the 
Kemeny median. 

5.1.1 The Cook-Seiford median 

Definition (Cook & Seiford, 1978) 

The distance between two preference orders is defined in terms of positions taken by 

alternatives 

d(P\P^) = S|qf'-q^ (23) 

i=i 

where qf denotes the position taken by the i-th alternative in k-th expert's opinion. 

It can be shown that the distance defined in such a way satisfies all the axioms describing 
the measure of closeness (Cook & Seiford, 1978; Cook, 2006). 
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The distance of a preference vector P from the set of experts' judgements {P k }is as follows 

d(P,P< k >) = f>(P,P k ) = Xj|qf -qu| (24) 

k=l k=l i=l 

where qi denotes the position of an alternative Oi in the preference order P. 
The optimization problem can be formulated as follows. 
Find such a preference order P that 



d(P,P< k >) = minXd(P\P) = minXElqf -qJ 



(25) 



If one assumes that the alternative Oi can take the j-th position in the preference order P, 
(j = 1, . . ., n), the distance (24) can be written in the form 



d(P k ,P) = ZZd ljyij 



(26) 



i=i j=i 



where 



d^IkMl 



(27) 



and yij is defined as (12). 

The matrix of d- coefficients is denoted as D. d- expresses the aggregated difference 

between the position of the i-th alternative in the preference order P and its positions in the 
preference orders P k , k=l, . . ., K. The distance matrix D is of the form 



D: 






o, 



J-l 


J-2 ■ 


■ J = n 


d„ 


d 12 • 


■ d ln 


d 21 


d 22 ■ 


■ d 2n 


d nl 


d n2 • 


■ d nn 



(28) 



The lower bound of the distance (26) is equal 

n 

C = X d imm / where d lmm =min[d ll/ ....,d ln ] . 



(29) 



The value of C is of importance for estimation how close is the group judgement derived to 

an "ideal" judgement. 

The minimization problem (25) can be rewritten as a linear assignment problem 



mm£lXy.J' 

" i=l j=l 



(30) 



subject to 
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v 2>ii =1 ' ( 31 ) 

1=1, ...,n j = l 

i.e. an alternative can be placed at one position only and 

V Zy^ 1 ' ( 32 ) 

j=l,...,n i=l 

i.e. only one alternative can be placed at a given position. 

The latter constraint can be easily derived from (17) in the structure framework. 

Summing (17) over 1,1 = l T ,...,f. T and taking into account (15) one obtains 

?7 n n 77 n 77 

V ZZy.'t =Z Zyit =Zyix = Z s 'TYf T • ( 33 ) 

T=2,...,2n 1=1^1=1 i=l 1=1^ i=l 1=1^ 

For the case of no ties one has T=2, 4, . . ., 2n (j=l, 2, . . .n). 

From the table of structures (Table 5) it can be seen that Sjj=l and Yjj=l. Finally one 

gets v Zy.j =1 - 

j=l,...,n i=l 

This problem was described in details e.g. in Bury & Wagner, (2000) and Cook, (2006). 

5.1.2 The Litvak median 

Litvak (1982) introduced the notion of so called preference vector. For the preference order 
given by k-th expert the preference vector is defined as follows: 

^=K,...,<], (34) 

where Ttf is equal to the number of alternatives preceding the i-th one in this preference 

order. 

Under assumptions taken 

*?=<!? -1, (35) 

where q k denotes the position taken by the i-th alternative in the k-th expert's opinion. 

Example 3. 

For a preference order P 1 from Table 1, P 1 = {O2, O3, O4, Oi, O5} one has n 1 = [3, 0, 1, 2, 4]. 

The distance between preference orders is expressed in terms of corresponding preference 

vectors. 

Definition (Litvak, 1982). 

Given two preference vectors Tr* 1 i Jt k2 of the preference orders P kl and P kl respectively. 

The distance between these two preference orders is defined as 

d(p\P k ;)=J|7t k >-7^|. (36) 

i=l 

It can be shown that the distance defined in such a way satisfies all the axioms describing 
the measure of closeness (Litvak, 1982). 
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Definition (Litvak, 1982). 

The distance of a preference order P from the set of experts' preference orders {P k } is as 

follows 



d(p,poo)=IZK-<|- 



(37) 



Let hf (i) =|7tf (i) -7t l k | i=l,...,n; j=l,...,n; k=l,...K, 



(38) 



where 7tf (i ' is the number of alternatives preceding Oi if O; takes the j-th position in the 

preference order P. 

Summing hf (i) (38) over k (k=l,...,K) one obtains h| j) 



h{»=2>! 



Hi) 



The distance (37) may be written as follows 



d(p,p (k) )=EEI|< (i) -<h=IS h ' i) y iJ 

i=l j=l k=l i=l j=l 



(39) 



(40) 



where yij is defined by (12). 

The matrix of hj' 1 coefficients is denoted as H, hj'' expresses the aggregated difference 

between the position of the i-th alternative in the preference order P and its positions in the 

preference orders P k , k=l, . . ., K. 

The distance matrix H is of the form 



H = 



O, 

o, 



o. 



j=l 


j = 2 •■ 


j = n 


[hf 


hf ■• 


h< n) 


h' 1 ' 


hf ■• 


h> n) 


X 11 


hf» ■• 


h (n, 



(41) 



The lower bound of the distance (40) is equal (Litvak (1982)) 

G = |> lmm , where h lmm = minth' 1 ' h< n »] 



(42) 



The problem of determining the Litvak median can be formulated as the following binary 
optimization problem (Litvak, 1982; Bury & Wagner, 2000) 



minEZhi'Vij 

y 'i i=l j=l 



(43) 



subject to (31)-(32). 
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One should note that in the case when in expert judgement tied alternatives can occur the 

definition of preference vector makes it easy to determine its components and the distance 

matrix. The preference vector for expert judgement with ties is of the same form for classical 

as well as for fractional position system. Hence it may be applied for the optimization 

problem formulated (43). 

Example 2. 

For a preference order P? from Table 1, P? = {(O2, O3), O4, (Oi, O s )} one has TV 1 = {3, 0, 0, 2, 3}. 

5.1.3 The Kemeny median 

The distance for the Kemeny median method is defined with the use of pairwise comparison 

matrices (Kemeny, 1959; Kemeny & Snell, 1960). 

For a given preference order of n alternatives presented by the k-th expert (k=l,...,K) 

P k : O; ,0; ,...,O i the matrix of pairwise comparisons can be constructed as follows (e.g. 

Litvak, 1982); it is assumed that in expert judgement ties can occur. 

[1 forO^O, 

where a„ = Jo for Oi «0, , a£ ^ (44) 

-1 forO.^O, 

_~nl' ■■■' "nnj ^ ' ' 

i=l, ...n,l=l, ...,n. 

The notation ; >- O, ( O; « O, ) should be read as follows: the i-th alternative Oi is better 
than (equivalent to) the 1-th alternative Oi with respect to a chosen criterion (a set of criteria). 
Assume that two preference orders P kl and P k * are given. The distance between these two 
preference orders is defined as follows (e.g. Litvak, 1982; Bury & Wagner, 2000) 

d(P\P^)4l2K'-aS'|. (45) 

^ i=l 1=1 

It can be shown that the distance defined in such a way satisfies all the axioms describing 
the measure of closeness (Litvak, 1982). 

The distance between a given preference order P and a set of preference orders given by the 
experts is defined as 



n n K 
z i=l 1=1 k=l 

The following equality holds 



d=d(p,p«)4£XIk;- a o|- (46) 



| a u - a n I + | a ii - a n I = 2 | a ii - a u I = 2 | a ii - a u ( 47 ) 

so the expression (46) can be rewritten as follows 

d =ZZZho- a 'l+ZZZhu-4| (48) 

where 



k=l i 1 k=l 

(i,l)€l' P " (i,l)€l'p 2 ' 
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Ip' -the set of indices (i,l) for which O; >- O, in the preference order P, i.e. ajj = 1 . (49) 

Ip 2> -the set of indices (i,l) for which ; » O, in the preference order P, i.e. ajj = . (50) 

Assume that in the preference order P O i >■ O, , i.e. a u =1. In order to determine the 
distance of this preference order from a given set {P k }, use can be made of coefficients 
defined as follows 

rn=id fl (P,P M )=i|a5-aH| = i|aM- ^ 

k=l k=l k=l 

They are called the loss coefficients and the matrix R=[ru] is called the loss matrix 
(Litvak,1982). It is assumed that ru=0 for all i=l. It should be noted that elements of the 
matrix R depend upon the form of preference orders P k (k=l,...,K) only and are independent 
of position system assumed (classical as well as fractional - if applicable). 
Making use of the coefficients rii (i,l=l,...,n) the formulae (48) can be rewritten in the 
following form (Bury & Wagner, 2000; Litvak, 1982): 

d = IZr,i+IIZhu|- (52) 

i 1 i 1 k=l 

(uK 11 (i,i)-i< ! > 

If in the group judgement ties are not allowed, then Ip 2 ' = {i,i}, i=l,...,n. In this case we have 
(Litvak, 1982) 

d= £r a . (53) 

(iJH" 

This formula can be used to determine the lower bound E of the distance (53). Litvak has 
shown that in the case under consideration the following theorem is satisfied. 
Theorem (Litvak, 1982). 

E = ZEmin(r il ,r ll ). (54) 

i=U=i+l 

These results make it possible to propose some heuristic algorithms for determining the 
Kemeny median (Bury & Wagner, 2000; Litvak, 1982). 
The distance (53) can be written in the form 

d(P,P (k) ) = ES r uX i] , (55) 

1=1 1=1 

where 

[1 for O; >- O, in the preference order P 



otherwise 



(56) 



The problem of determining a group judgement that minimizes the distance (55) can be 
formulated as follows 



Group Judgement with Ties. Distance-Based Methods 



165 



min EE r ii x ii 



(57) 



It should be emphasized that this problem (even if ties are not allowed in group judgement) 
cannot be solved within the framework presented in this subsection. It needs to be 
formulated within the framework of structures presented for the case of ties. 

5.2 The case of ties in group judgement 

For this case the fractional notation is to be used i.e. 

t e V ={1, lVi, 2, 2Vi, 3, 3Vi, , n-1, n-Vi, n}. 

5.2.1 The Cook-Seiford median 

The problem (30) ■*■ (32) is to be modified (double positions T are applied). 

The distance matrix D is of the form 



D: 



o, 



o„ 



T = 2 


T = 3 


'1 


= 4 ■ 


■ T = 2n 


di2 


dn 




di4 


dl2n 


d 2 2 


d 2 3 




di 2 


3 12 

dn 


dn2 


dn3 




dn 


dn 



(58) 



where 



a lT = X|2q I k -T| 



(59) 



and qf is the number of fractional position taken by an alternative Oi in the preference 
order P k given by the k-th ekspert. 
The optimization problem is as follows 



min££diTy iT 



(60) 



with the constraints (15)-^(20). 

5.2.2 The Litvak median 

For the case of ties it is worth to notice that the number of alternatives preceding a given i-th 

one in the preference order is determined by the level I this alternative takes in the table of 

structures and equals ( I -1). This means that alternatives from the first level are preceded by 

zero alternatives, alternatives from the second level are preceded by one alternative, etc. 

The following example shows this observation. 

Example 3. 

For a preference order P 2 from Table 1, P 2 = {(O2, O3), O4, (Oi, O5)} one has 



P 2 = {4V2, IV2, V/i, 3, m}. 
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Hence 

(O2, O3) are placed in the positions IV2, IV2 on the level I =1; it is evident there are no 

alternatives preceding those ones, nl = , Jt 3 = , 

O4 takes the 3rd position on the level I =3, it is preceded by two alternatives, 7t 4 = 2 , 

(Oi, O5) are placed in the positions 4 1 /2, 4V2 on the level £ =4, they are preceded by three 

alternatives, k\ = 3 , n 5 = 3 . 
It may be shown that the components of a preference vector for a respective preference 
order P may be determined as follows 



--(f:-l)A,, f.=l,...,n, 



(61) 



where A, is given by (14) . 

The preference order P 2 ={(02,03), O4, (Oi, O5)} presented with the use of the table of 
structures is as follows 
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2 


3 


4 


5 


6 


7 


8 


9 


10 




t 


1 


1,5 


2 


2,5 


3 


3,5 


4 


4,5 


5 


Ai=l 


1=1 




(O2, O3) 
















A 2 =0 


1=2 




















A 3 =l 


f.=3 










4 










A 4 =l 


e=A 
















(Oj, o 5 ) 




A 5 =0 


e=5 





















Table 7. Positions and levels of alternatives for the preference order P 2 . 
For a preference order P 2 given above one has 



r P(i) 



r P(3) . 



r P(4) . 



(1-1)A 1= 0, K. 
= (3-l)A 3 =2, 



p(i) 



0, 



r P(*) - 



Jt ;-=(4-l)A 4 =3,< 

hence A = [1,0, 1, 1, 0] , tt 2 = [3, 0, 0, 2, 3]. 

The coefficients h, of distance matrix H are of the form: 



R^=|< w -n| t | i=l,...,n; k=l,...K; < =1 n. 



(62) 



Summing coefficients h; (62) over k (k=l,...,K) one gets 



^). 



EH, k 



k(') 



(63) 



It should be emphasized that the components of preference vectors result from the table of 
structure, i.e. they depend on position as well as on the level a given alternative is located. 



Group Judgement with Ties. Distance-Based Methods 



167 



As components of preference vectors 7t k , k=l, . .., K are the same regardless of the notation 
of positions applied, it can be seen that H = H . The bar over H indicates another way of 
defining the components of preference vector only. One has 



H = 





t. = \ 


1 = 2 ■ 


■ ? = n 


o t 


"h« 


n?> ■ 


■■ h< n) 


o 2 


ht> 


hf> ■ 


■• hl n) 



o. 



hi 



-(") 



(64) 



The optimization problem for Litvak median is as follows: 

n 2n ^t 

YiT i=lT=!!=(, 

subject to (15)-(20). 



(65) 



5.2.3 The Kemeny median 

If ties can occur in the group judgement, an optimization problem to be solved is more 
difficult. As it was mentioned before, the Kemeny median is defined with the use of 
pairwise comparisons. To solve the problem of determining group judgement one has to 
introduce some additional constraints. 

The problem of finding a preference order P such that the distance (52) is minimized can be 
formulated as follows: 



where 

xu is given by (56), 



/ 11 11 II — L 11 1 

m l n ZEvii+EE^zi 



(66) 



1 for O; a 0[ in the preference order P 

otherwise 



(67) 



m 



(68) 



rji is given by (51), 

subject to a specially defined set of constraints. This set can be divided into two groups. First 

one contains those resulting from the table of structure. The second one consists of 

constraints resulting from the definition of the Kemeny median. 

The following binary variables are introduced: 

fl if ( located on the level £ is such that CK y O, in the preference order P, 

Xj[ = < for (!>! from an arbitrary level X>f. (69) 

otherwise 
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if Oj kO, on the level £ in the preference order P 
otherwise 



The constraints are as follows 

for £ =1, . . ., n-1, i = 1, . . ., n, 1 = 1, . . ., n: 



IX 



n-1 
(=1 



It should be noticed that 



1 iff 



SYrr = l 



and 



zj = l iff 



preceding 
alternative 



Iy< T 



Z Irfr = i 



preceded 
alternative 



Zyf T 



(70) 



(71) 
(72) 



(73) 



(74) 



For the case of binary minimization problem (66) with positive coefficients of the objective 
function, in order to satisfy this condition the following inequalities should hold true 



x£*2X+I Syit-i 



(75) 



preceding preceded 

alternative alternative 



and 



4* ZlyWrr)-!. 



(76) 



Hence the optimization problem becomes: 

solve (66) subject to the constraints (67) ■*■ (76) and (15)-^(20). 



6. Numerical examples 

6.1 The Cook-Seiford median 

Example 4. 

Given the set of five alternatives and eleven experts' judgements. 

There are no ties in experts' judgements. 
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Pi = {0 4 , O h O3, O2, O5} 

P2 = {O3, Oj, O5, Oz, O4} 

P3 = {O3, Oj, O5, Oz, 4 } 

P 4 = {0 2 , Oi, O5, O4, o 3 } 

P5 = {Oi, O5, O4, O3, 2 } 

P 6 = {O3, O5, O2, O4, Oi} 

P? = {0 4 , Oj, O3, O5, 2 } 

P» = {Oi, O3, O5, O2, 4 } 

P9 = {O s , Oi, O2, O4, O3} 

Pi°= {Oi,02,03,05,0 4 } 

p"= {Oi, 05,02,03,04} 

The lower bound C (29) of the distance is equal to 51. 

If ties are not allowed in group judgement the solutions obtained are 

{Oi, O3, Os, O2, o 4 } 

{Oi, o 5 , O3, O2, o 4 } 
and the distance (30) from the set of preference orders given by experts is equal to 58. 
The group judgement with ties is {Oi, (O2, O3, O5), O4}. 

The distance (60) from the set of preference orders given by experts is equal to 56. 
Example 5. 

Given the set of five alternatives and eleven experts' judgements. 
Ties are allowed in experts' judgements. 

P 1 = {O3, (O4, Os), (Oi, O2)} 

P 2= {(Oi, Oz, O3), (04, O5)} 

P 3 = {O3, O2, (Oi, O4, o 5 )} 

P 4 = {O2, (Oi, Os), (O3, 4 )} 

P 5 = {(Oi, o 4 ), o 3 , o 5 , o 2 } 

P 6 = {(02,0 4 ), (0i,0s),0 3 } 

P 7 = {(Oi, Oz, o 3 , o 5 ), o 4 } 

P 8 = {O5, (d, o 3 ), o 4 , o 2 } 

P" = {0 2 , (Oi, O3, 4 ), Os} 

P 10= {Oi, O2, (O3, O4, O5)} 

P u = {Oi, (O2, O3), (O4, O5)} 

The lower bound C (29) of the distance is equal to 52. 

If ties are allowed in experts' judgements only then the solutions obtained are as follows 

{Oi, 02, o 3 , o 5 , 04 
{Oz, Oi, o 3 , o 5 , o 4 } 

and the distance (30) from the preference orders given by experts is equal to 66. 

The group judgement with ties is {(Oi, O2, O3), (O4, O5)}. 

The distance (60) from the set of preference orders given by experts is equal to 60. 
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6.2 The Litvak median 

Example 6. 

Given the set of six alternatives and eleven experts' judgements. 

There are no ties in experts' judgements. 

P 1 = {Oz, Oi, Oa, 5 , Os, 4 } 

P 2 = {Oi, Oa, Oz, Os, O fr 4 } 

P 3 = {O4, O5, Og, O3, Oi, 2 } 

P 4 = {O5, Oi, o 4 , O fi , O2, o 3 } 

P 5 = {O s , O4, O2, o 6 , d, o 3 } 

P 6 = {Oz, Os, O4, 6 , O3, Oi} 

P 7 = {d, O5, O2, O4, O3, o 6 } 

P 8 = {0 6 , Oi, O2, o 5 , O3, o 4 } 

P" = {O2, Os, Og, O4, O3, Oi} 

P 10 = {O4, Oz, Ofi, Oj, O3, Os} 

P n = {O4, Os, Ofi, Oj, O2, o 3 } 

The lower bound G (40) of the distance is equal to 82. 

If ties are not allowed in group judgement the solutions obtained are 

{O4, O5, Oz, Os, Oi, O3} 

{0 2 , Og, O4, Os, Oi, o 3 } 
The distance (43) from the set of preference orders given by experts is equal to 98. 
The group judgement with ties is {(O2, O4, O5), 06, Oi, O3} 

The distance (65) from the set of preference orders given by experts is equal to 96. 
Example 7. 

Given the set of six alternatives and eleven experts' judgements. 
Ties are allowed in experts' judgements. 

P 1 = {Oi, Og, O4, (O2, o 3 ), o 5 } 

P 2= {(Oi, O2), O4, Og, (O3, Os)} 

P 3 = {0 4 , (Oi, 2 ), Og, (O3, Ofi)} 

P 4 = {(O2, O3, Os), 4 , Oi, o 5 } 

P 5 = {(Oa, Os), (Oi, O2, Ofi), 4 } 

P 6 = {(Oj, O4, 5 ), (O3, Ofi), Oi} 

P 7 = {O3, (O4, Os, Os), (Oi, 2 )} 

P 8 = {Oi, (O3, Ofi), (O2, O4, Os)} 

P" = {(Oi, Os), O3, (Oz, Ofi), o 4 } 

p io = {(o 3 , Os), 02,04,05,04 

P u = {Oi, O2, (O3, O4, 5 ), Os} 

The lower bound G (42) of the distance is equal to 95. 

The solutions obtained for the case of no ties in group judgement are 

{Oj, O3, O2, Ofi, O4, 5 }, {Oi, O3, O2, Ofi, O s , O4} 

{Oj, O3, O2, O4, Ofi, Os}, {Oi, Oz, O3, Ofi, O4, O5} 

{Oi, O2, O3, Ofi, 5 , 4 }, {Oi, O2, O3, O4, Ofi, o 5 } 
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The distance (43) from the set of preference orders given by experts is equal to 116. 

The group judgement with ties is {(Oi, O2, O3, O4, O5, C>6)}. 

The distance (65) from the set of preference orders given by experts is equal to 97. 

It is worth to note that in this example making all the alternatives equivalent significantly 

improved the distance. 

6.3 The Kemeny median 

Example 8. 

Given the set of seven alternatives and eleven experts' judgements. 

There are no ties in experts' judgements. 

P 1 = {Oi, Oi o 6 , o 7 , o 5 , o 3 , o 4 } 

P 2 = {o 2 , 6 , O4, o 5 , o :/ o 7/ o 3 } 

P 3 = {0 3 , Oi 6 , Oi, O z , O4, o 5 } 

P 4 = {o 4 , o 1 , o 6 , O5, O3, Oi o 7 } 

P s = {0 4 , 6 , Oi, o 3 , o 7 , o 5 , 2 } 

P 6 = {0 4 , Oi o 7 , 6 , Oi, o 3 , o 5 } 

P 7 = {o 4 , o 7 , o 6 , o^ O3, Oi o 5 } 

P 8 = {o 6 , o 3 , o 7 , 1 , Oi O4, o 5 } 

P 9 = {o 2 , o 5 , o 7 , o^ o 3 , O4, o 6 } 

P 10 = {0 7 , 6 , Oi O5, 3/ O4, Oj} 

P 11 = {0 6 , O3, Oj, Oi 7/ O4, 5 | 

The lower bound E (54) of the distance is equal to 166. 

If ties are not allowed in group judgement the solutions are 

{O b , Oi, Oi 7 , O3, O4, O5}, {0 6 , Oi, O3, Oi 7 , O4, O5} 

and the distance (55) from the set of preference orders given by experts is equal to 170. 
The group judgement derived for the problem (66) -=- (76) (with ties assumed) is 
{Of,, 0\, Oi O7, O3, 04, O5J.lt is worth to note that it doesn't contain tied alternatives even 
though ties were allowed. 

The distance (66) from the set of preference orders given by experts is equal to 170. 
Example 9. 

Given the set of seven alternatives and eleven experts' judgements. 
Ties are allowed in experts' judgements. 

P 1 = {Or, Oi O4, 7 , (0 3 , O s , 6 )} 

P 2 = {(O4, 6 ), O7, 5/ (Oi 3 ), Oi} 

P 3 = {0 4 , 7 , (Oj, Oi 6 ), (0 3 , O s )| 

P 4 = {Oi O3, (0 4/ O5, 6 ), 7 , Ox} 

P 5 = {(0 3 , O s , 6 , 7 ), (Ox, 4 ), 2 } 

P 6 = {0 4 , 6 , (Oj, 2 ), (O3, o 5 ), o 7 } 

P 7 = {d, (O3, o 4 , 5 , O7), (Oi 6 )} 

P 8 = {0 4/ (O3, 6 ), 7 , {Oi, o 2 ), o 5 | 

P 9 = {o 5 , o 6 , O4, Oi o 3 , (O^ 7 )} 
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P 10 = {0 5 , d, O3, (O2, O4, o 6 ), o 7 } 

P u = {(Oi, O2), (o 3 , o 4 ), (O5, 6 ), 7 } 

The lower bound E (52) of the distance is equal to 187. 

If ties are not allowed in group judgement the solutions obtained are 

{O4, 6 , Oi, O2, O3, O5, O7}, {O4, 6 , Oi, Oz, o 5 , o 3 , o 7 } 

and the distance (55) from the set of preference orders given by experts is equal to 187. 
The group judgement with ties is {O4, (Oi, O2, O3, O5, 06), O7} 
and the distance (66) from the set of preference orders given by experts is equal to 177. 

7. Concluding remarks 

In the paper it is shown that the fractional notation and the structure table can be 
successfully applied to find the group judgement with ties for the distance defined as the 
Cook-Seiford, Litvak and Kemeny median. On the basis of results obtained it seems 
reasonable to make an attempt at applying the approach discussed to the other methods of 
determining group judgement with ties or in the case when some alternatives are not 
comparable. 
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1. Introduction 



The robot dynamics modeling and simulation problem has been studied for the last three 

decades intensively. 

In particular, the forward dynamics problem of a robot is a very relevant issue, which there 

is still to say about in terms of efficient computation algorithms, that can be also simple to 

understand, to develop and to implement, above all for practical robots, robots with many 

links and/ or with flexible links (Featherstone, 1987), (Featherstone & Orin, 2000), (Sciavicco 

& Siciliano, 2000), (Khalil & Dombre, 2002). 

Indeed, in these cases the methods based on the classical Lagrange formulation give rise to 

an analytical model with numerous terms that may be difficult to use. The methods based 

on the Newton-Euler formulation are not very easy to apply and do not provide easily 

manageable analytical formulae, even if they are efficient from a computational point of 

view (Featherstone, 1987). 

An important contribution to solve the previous problems is given (Celentano, 2006), 

(Celentano & Iervolino, 2006), (Celentano & Iervolino, 2007) by a new, simple and efficient 

methodology of analysis, valid for all of robots, that makes use of a mathematical model 

containing a lower number of algebraic terms and that allows computing, with a prescribed 

maximum error, the gradient of the kinetic energy starting from the numerical knowledge of 

the only inertia matrix rather than using, as usually found in the literature, complex 

analytical calculations of the closed-form expression of this matrix. 

This result is very strong because it allows solving the forward dynamics problem of a robot 

in a simple and efficient manner, by analytically or numerically computing the inertia 

matrix and the potential energy gradient only. 

Moreover, this method allows students, researchers and professionals, with no particular 

knowledge of mechanics, to easily model planar and spatial robots with practical links. 

From this methodology follows also a simple and efficient algorithm for modeling flexible 

robots dividing the links into rigid sublinks interconnected by equivalent elastic joints and 

approximating and/or neglecting some terms related to the deformation variables. 

In this chapter some of the main results stated in (Celentano, 2006), (Celentano & Iervolino, 

2006), (Celentano & Iervolino, 2007) are reported. 
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In details, in Section II the new integration scheme for robots modeling, based on the 
knowledge of the inertia matrix and of the potential energy only, is reported (Celentano & 
Iervolino, 2006). 

In Section III, for planar robots with revolute joints, theorems can be introduced and 
demonstrated to provide a sufficiently simple and efficient method of expressing both the 
inertia matrix and the gradient of the kinetic energy in a closed and elegant analytical form. 
Moreover, the efficiency of the proposed method is compared to the efficiency of the 
Articulated-Body method, considered one of the most efficient Newtonian methods in the 
literature (Celentano & Iervolino, 2006). 

In Section IV, for spatial robots with generic shape links and connected, for the sake of 
brevity, with spherical joints, several theorems are formulated and demonstrated in a 
simple manner and some algorithms that allow efficiently computing, analytically the 
inertia matrix, analytically or numerically the gradient of the kinetic and of the gravitational 
energy are provided. Furthermore, also in this case a comparison of the proposed method in 
terms of efficiency with the Articulated-Body one is reported (Celentano & Iervolino, 2007). 
In Section V some elements of flexible robots modeling, that allow obtaining, quite simply, 
accurate and efficient, from a computational point of view, finite-dimensional models, are 
provided. Moreover, a significant example of implementation of the proposed results is 
presented (Celentano, 2007). 
Finally, in Section VI some conclusions and future developments are reported. 

2. A new formulation of the Euler-Lagrange equation 

In the following a new formulation of the dynamic model of a robot mechanism in a more 
efficient form (for its analytical and/ or numerical study) is presented. 

It is well known that the usual form of the Euler-Lagrange equation for a generic robot with 
n degrees of freedom, which takes on the form (De Wit et al., 1997) 

d dT dT m 

at oq oq 

where: q is the vector of the Lagrangian coordinates; T is the kinetic energy, given by 

1 
T = —q T B(q)q , being B the robot inertia matrix; y is the vector of the gravity forces, r/ is 

the vector of the elasticity forces and u a is the non-conservative generalised forces (e.g. 
damping torques), which are usually function of q and q only, and u is the vector of the 
actuation generalised forces, is typically rewritten as 

B(q)q + C(q,q)q = / + rj + u t +u, (2) 

in which an expression of matrix C is the following: 



C(<M) = £t-<7>-^ :„„ ■ ( 3 ) 

w oq i 





~. T 8B~ 


1 


9<7i 


2 


. T dB 




dq n 
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By setting B = u>„ L C = jc,, j , it is also 



cb. 



2 da, 



(4) 



Alternatively, an equivalent matrix C c = jc . j, i.e. such that C c q = Cq , that makes uses of the 
Christoff ell's symbols (Sciavicco & Siciliano, 2000), is given by 



t-1 ^ 



8q, dq t 



(5) 



There are various methods to calculate both B(q) and C(q, q) . The simplest one, from a 

conceptual point of view, based on analytical expressions, is extremely complex and 

onerous, even if symbolic manipulation languages are employed. Other methods do require 

a more in-depth knowledge of mechanics (De Wit et al., 1997). 

The next theorem provides an innovative, efficient and simple method for modeling and 

simulating a robot. 

Theorem 1. If the Euler-Lagrange equation is rearranged as follows: 



dt 



(Bq) = c + y + ?j + u il +u, 



where c is the gradient of the kinetic energy with respect to q , namely 

_ 8T 
dq 



(6) 



(7) 



then it can be efficiently integrated according to the innovative integration scheme of Fig. 1, 
where q = Bq and F is a suitable function that allows computing q and q from u , q 

and q g . 
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Fig. 1. Block diagram for the robot forward dynamics integration. 

Proof. It is easy to verify that the algorithm that describe the function F of the block scheme 
of Fig. 1 is: 
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Step 1. Compute B(q) by applying one of the classic methods proposed in the literature (see 

also Theorems 2 and 3 and (30) in Section 3). 
Step 2. Compute q through the relationship q = B'^q . 

Step 3. Compute c(q,q) and u n . 

Step 4. Compute — (Bq) = q=c + / + /] + u a +u. 

Remark 1. The proposed method allows avoiding the computation of the first term on the 
right hand-side of (4) or of the first two terms on the right-hand side of (5). Such saving is 
significant because the computational burden relative to this term is not negligible with 
respect to the one required by the calculation of B^q w . 

This clearly emerges from Fig. 2, where the percentages of saved flops vs. the number of 
flops required to compute B ' q are evaluated (in MATLAB™ environment) with reference 

to the coefficients in (4), for a considerable number of cases of robots with random and 
anyhow realistic links parameters. 

Percentages of saved flops 
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number of links 

Fig. 2. Percentages of saved flops evaluated for robot models with random links parameters. 

Remark 2. Since many efficient algorithms for the numerical computation of the matrix B 

are available in the literature, the gradient of the kinetic energy c can be computed in a very 

simple and accurate way numerically, instead of using a symbolic expression. 

To this end, since the kinetic energy for the majority of the robots is a quadratic function 

respect to q , whose coefficients are constant with respect to q or depend on q i according to 

terms of the type sin(flg ; +<y9) , the following lemma is useful. 

Lemma 1. The derivative of sin{aq.) can be numerically computed by a two [three] points 

formula: 



d , . sin \a(q + A) -sin(flo) 

— sm(aq { ) = — LjLlj ^— — 

dq t A 



_d_ 
dq 



sin(fl(7 ; ) = 



sin[fl(^ + A)J - sin[a((7 ; - A)J 



2A 



(8) 
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with error: 



\(az)A, \e 2 \<a 2 — , %e(q { ,q { +A) 



1 A 

e 3 =—-A 2 a 3 cos(ax), \e 3 \<a 3 — , % e(q,-A,q.+A) 



(9) 



Proof. See any numerical computation book. 

In view of Lemma 1 it is possible to compute, with a prescribed maximum error, the 
gradient of the kinetic energy starting from the numerical knowledge of the inertia matrix 
B rather than using, as usually found in literature, complex analytical calculations of the 
analytical expression of B . In details, for practical precision purposes, a good value of 
dT/dq can be obtained using a two [three] points formula with A = 10 -^10 4 [with 
A = 1CT 2 -^lCT 3 ] and evaluating n-\ [2(n-l)] times the inertia matrix B numerically. 

3. Planar robot modeling 

In the case of planar robots with revolute joints, theorems can be introduced and 
demonstrated to provide a sufficiently simple and efficient method of expressing both the 
inertia matrix and the gradient of the kinetic energy in a closed and elegant analytical form. 

3.1 Hypotheses and notations 

In the following it is considered the case of a generalised planar robot constituted by n links, 

each of them with parallel rotation axes C~ ,C* and center of mass G belonging to the 
plane containing the two relative rotation axes (see Fig. 3). In Fig. 4 a planar robot with three 
links and a horizontal work plane is shown. 




Fig. 3. The generic link of the considered generalised planar robot. 
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Fig. 4. The considered generalised planar robot with three links and horizontal work plane. 
Referring to Figs. 5 and 6 the next notations are also employed: 




Fig. 5. Geometric characteristics of the i-th link. 




Fig. 6. Schematic representation of a planar robot with n links. 
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<7i = IX V: a X = absolute coordinates of the i-th link, 

I, = inertia matrix of the i-th link in terms of the coordinates q t , 

M ; = total mass of the i-th link, 

Ji,Pi = inertia moment and radius of the i-th link with respect to the rotation axis Cj , 

N ; = static moment of the i-th link with respect to the plane containing CJ and 

orthogonal to the plane passing through C~ and C* , 
L a = distance of the center of mass G, of the i-th link from the axis C~ , 

a 1 j = [a 1 a 2 ... a t ] T = vector of the absolute angular coordinates of the chain constituted by 

the first i links, 
Pi t = [Pi Pi ■■■ PtY ~ vector of the relative angular coordinates of the chain constituted by 

the first i links, 
A : = inertia matrix of the i -th link in terms of the coordinates a 1 , , 

B t = inertia matrix of the i -th link in terms of the coordinates /?] . , 

a = \a 1 a 2 ... a n ] T = vector of the absolute angular coordinates of the robot, 

P = [Pi Pi ■■■ P n Y = vector of the relative angular coordinates of the robot, 

A = inertia matrix of the robot in terms of the coordinates a , 

B = inertia matrix of the robot in terms of the coordinates /? . 

3.2 Main results 

Consider a generic link of a planar robot, with revolute joints. The following theorem can be 

stated. 

Theorem 2. The kinetic energy T, of the link of a planar robot, whose rotation axes are 

parallel and center of mass belongs to the plane that contains the respective rotation axes 

(see Figs. 3 and 5) can be calculated by the relationship 

T { =lqJI t q„ (10) 

where 



(11) 

Proof. By omitting subscript i , the coordinates of a generic point P of link L are (see Fig. 7) 

x p =x + ^cosa-rjsina . 

y p =y + ^sina + rjcosa, *• ' 

from which it is easy to prove that the square of the velocity of P is 

V p 2 =x 1 p +y 2 p = x 2 +y 2 +d 2 {^ 2 + tj 2 )- 2x d(4 ^sin a + tj cos a) + 2ya(£ ^cos a - tj sin a) 
= V 2 +d 2 d 2 -2xd(<^ sin a + rj cos a) + 2yd(<^ cos a- rj sin a). 



M, 





-N t sin a; 




1 





-L Gj sin a 





M, 


N t cos a 


= M i 





1 


L Gj cos a t 


' t sin a t 


N t cos a { 


J. 1 




- L a sin a i 


L cj cos a t 


Pf 
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x x p x 

Fig. 7. Determination of the coordinates of a generic point P of the link L. 

-i 
Since the kinetic energy of link L is T = — \V 2 dm, and for the hypothesis made about the 

^ L 

characteristics of the link, the proof of the theorem easily follows. 

Consider a planar robot with n links (see Fig. 6). The following theorem holds: 

Theorem 3. The kinetic energy of the i-th link of a generic planar robot can be calculated by 

the relationship 



where the inertia matrix in terms of the angular coordinates a 1 , , expressed by A t 
can be calculated by using the formula: 



(14) 



in which 



or more directly by the formulae: 



■R T I.R 



Lj sinaj 


-L 2 sin a 2 


-L i _ l sina^j 





L a cosa a 


L 2 cos a 2 


L ; j cosa^j 














1 



(15) 



(16) 



M,L\, if h<i 

],=M t pf, if h = i, 



(17a) 



MjL^ cos(a,. - a h ), if h <k < i 
M j L h L Gj cos(a, -a h ), if h <k = i. 



(17b) 



An alternative equivalent and more mnemonic expression for (17a)-(17b) is 
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1) Mh<i , 



2) \/h<k< i, 



«iM= M i L h> L » 



L h ,ifh< i 
Pi , if h = i, 



: ihk =M i L h L k cos{a k -a h ), L hk 



Kk> V h ' k < » 
L Gi , if h,k = i. 



Proof. For the generic j-th link, in the hypothesis that x 1 =y 1 = 0, it is 



y l =L l sin a 1 + L 2 sin a 2 + ... + L t sin a i 
«, = a, / 

from which 



(18a) 



(18b) 



(19) 



X t = —ce 1 L 1 sin a 1 -d 2 L 2 sin a 2 -...-d i L j sina l 

ij i = (ijLj cosaj +d 2 L 2 cosa 2 + ... + d l L i cosa i 

d, = d, , 



(20) 



q t =R i d 1 , . 
It follows that 

t, =-qJhi =-«i T 2..,-R, T ^-R,« 1 .., 

and hence (15). 
From (15) it is: 
1) Vh<k< i 

M,- -N,-sina,- -L k sma k 

Cut = \-L h sma h L k cosa h 0j- M : N t cosa ■ L k cosa k 

— N,-sina r - N,-cosar ; J j 



** "*• 



(21) 



(22) 



= M l L h L k cos(a k -a 



);( 23 ) 



2) \/h<k = i 

a iM - [- L h sin a h L h cosa h 0]- 

3) fe = fc = z 



Mj -NjSina, 

M; IV, cos a 

- N ; sin a ; N ? . cos a, /, 



= L fc N j cos(a,. -aj;(24) 
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M, 





—N i sin a t 










M, 


N, cos a 







- N, sin a, 


N; cos a i 


/, 




1 



h.u = [° ° !]• 



and hence (17a), (17b). 

In view of Theorem 3, the kinetic energy of the robot is 



(25) 



where 



T = Ti+T 2 + ... + T n = —a Act , 



a = a x „ =\a^ a 2 ... a n ] T , 



(26) 



(27) 



and the inertia matrix A = A(a) of the robot in terms of the angular coordinates a is 

obtained by assembling the matrices A; as illustrated in Fig. 8 (namely, by summing to all 
the principal minors of A„ of order i<n the respectie A;'s). 



\ 



A 


A 2 


1 


K 






X 









Fig. 8. Graphical representation of the algorithm that computes the robot inertia matrix. 
In MATLAB-like instructions the algorithm to calculate the inertia matrix is: 

A=An; 

for i=n-l:-l:l 

A(l:i,l:i)=A(l:i,l:i)+Ai; 

end. 

Moreover, since a = Ufi , where 



U 



1 

1 1 



(28) 



1111 
the kinetic energy of the robot can be calculated by using the relationship alternatively 
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T = -p T Bp, 



(29) 



where the inertia matrix B ■■ 
obtained by 



B(/?) of the robot in terms of the angular coordinates ft is 



B = U T A(a)U = U T A(W\P))U . 



(30) 



Note that the computation of the inertia matrix is performed by jointly using the absolute 
and relative coordinates. This allows simplifying, in a natural manner, the algebraic 
expression of the analytical model. 

Clearly, if for a given /3 and hence for a given a = Mfi the interest is only about the 
numerical value of B , it is possible to numerically compute the matrix A preliminarily (from 
a by using Theorem 3 and the algorithm illustrated in Fig. 7) and then the matrix B 
through the relationship B=U T AU or through the following algorithm described in 
MATLAB-like instructions: 

for i=n-l:-l:l 

A(:,i)=A(:,i)+A(:,i+l); 
end 
for i=n-l:-l:l 

A(l:i,i+l)=A(i+l,l:i)'; 
A(i,l:i)=A(i,l:i)+A(i+l,l:i); 
end 
B=A. 
Remark 3. It is important to note that, although by virtue of Theorem 3 the analytical 
expression of A, is relatively simple, the analytical expression of A is quite complex and, as a 
result, the expression of B is extremely complex (e.g. the entry B(l,l) of B is the sum of all 
the elements of all the matrices A; !). 

Once the inertia matrix B is obtained, the dynamic model of the robot can be derived by the 
Euler-Lagrange equation in the usual form (1) or in the alternative form (6). In the case of a 
planar robot, with link characteristics as required by Theorem 2, the efficiency of the method 
illustrated in Theorem 1 can be further improved by using the following result. 
Theorem 4. The gradient of the kinetic energy of the i-th link in terms of a 12 , \fl l2 , ] can be 
calculated by the relationship 



dT t 

5AZ 





h<k:varfi 2 

M-i Z^«A« t sm K - a h ) 

h<k:varfo 

M, ^L h d h L k d k sm(a k -a h ) 

h<k:vai/}. 
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where 



ST, 



SJ3 12 _.. 





M, I L,,£/? p L t £/?,sin(X/? r ) 

h<k:varfc p=l q=l r=h+l 

M, I L,,2> p L t £/?,sin(X/? r ) 

fKk-.varfc p=l q=l r=h+l 



M, S L„X/?„^XAsin(XA) 

h<k:vaip, p=l <?=! r=fi+l 



(31a) 



if m < i 

if m = i 



, m = {h,k}, 



(31b) 



and the notation h<k: var/?, , I = 2, . .., i , indicates the set of the pairs h, k , with h<k , such 
that the angle a hk = a k - a h = f) ui + f) lHl + ... + fi k between the links h and k varies with /?, . 
Proof. In view of Theorem 3, the kinetic energy of the generic i-th link is 



1 i 

T . =^ M i £ L X 2 +^M ; df +M, ^L h L k d h d k cos K -«„) 

fc=/i+l,..,i 
fc=fe+l,..,i 



(32) 



where T c is the portion of kinetic energy that is independent of a 12 , . 

From (32) the expression (31a) of the gradient of T, in terms of a 12 , [ j8 12 , ] easily follow. 

The vector c can be simply obtained by suitably assembling the various vectors c, (i.e. by 

progressively summing to the first i components of c„, i<n, the respective C;'s). 

In MATLAB-like instructions: 

c=cn; 

fori=n-l:-l:2 

c(l:i)=c(l:i)+ci; 

end. 

Finally, it is important to note that if the work plane of the robot is vertical, the gravitational 

torques can be calculated by using the following result. 

Theorem 5. If the work plane of the robot is vertical, the weight of the i-th link originates a 

torque on the fc-th joint, k < i : 



y k =M i g'^L in cosa m , 



(33) 



where g is the gravitational acceleration and the lengths L m are given by (31b). 
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Proof. The proof easily follows by considering Fig. 6. 

Remark 4. The proposed method efficiency has been explicitly compared to the Articulated 
Body approach efficiency in terms of number of flops (in Matlab™ environment), for 
executing the same elementary integration step by using the absolute Lagrangian 
coordinates (see Fig. 9). From this figure it results that, for industrial robots with a limited 
number of links, the proposed method is more efficient. 
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Fig. 9. Efficiency comparison between the Articulated Body method and the proposed 
method. 

Remark 5. The above-derived analytical expressions, in explicit form, of B , C and y are 
useful for speeding up the simulation, for comparing them to the proposed numerical 
approach and for finally evaluating which terms can be simplified or neglected in the case of 
robots with many and/or flexible links. Thus, for instance, for flexible link robots models, 
obtained via discretization of the links, the terms relative to the deformation angles can be 
simplified by substituting the sine function with the respective argument or even be 
completely neglected, considering the terms relative to the motion only. 

4. Spatial robot modeling 

For spatial robots with generic shape links and connected, for the sake of brevity, with 
spherical joints, several theorems are formulated and demonstrated in a simple manner 
and some algorithms that allow computing in an efficient way analytically or numerically 
both the inertia matrix and the gradient of the potential energy are provided. 



4.1 Hypotheses and notations 

In the following, for brevity, only spatial robots constituted by n links of generic shape, 

connected by spherical joints are considered. 

Moreover, the following notations will be used (see Fig. 10): 
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Fig. 10. Coordinates of the generic point P of a link with respect to the body frame and to 
the inertial frame. 

q, =[Xo, a] ] T =[x oli x 02j x 03l a lt a 2i a 3i ] T =" absolute coordinates" of the i-th link, 

I; = [I M \ = inertia matrix of the i-th link with respect to the axes £, , f 2 , f 3 , 

?Gi = [^Gii ^C2i <?G3if ~ = coordinates of the center oj : mass G t in the body frame, 

%oh = [^o'n 4o-2i 4o-3if = coordinates of the rotation center O , in the body frame, 

M, = total mass of the i-th link, 

a-j j = [a\ a\ ... a] f = "angular absolute coordinates" of the chain constituted by the first i links, 

SR^a,.) = rotation matrix of the i-th link, 



da,,, 



A, = inertia matrix of the i-th link with respect to the coordinates a, t , 

a = a l n = "absolute angular coordinates" of the robot, 

A = inertia matrix of the robot with respect to the coordinates a, 

I = identity matrix of order 3, 

s h ,=sm(a hl ), 

c hl = cos(a hi ) . 



4.2 Main results 

Let us consider a generic link of a generic shape of a spatial robot with spherical joints. 
The following theorems and algorithms for the computation of the kinetic energy T t of the i- 
th link first as a function of the absolute coordinates q, of the i-th link and then as a function 
of the absolute angular coordinates a a , of the chain constituted by the first i links are 

provided. 

Theorem 6. The kinetic energy of the i-th link may be calculated by means of the 

relationship 
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T,=-q,3,q,, 



where 



M,I M^R^a 

h 



(34) 



(35) 



Proof. By omitting subscript i, the coordinates of the generic point P of the link in the fixed 
frame (see Fig. 10) are given by 



x p =x +M(a)$; p , 



(36) 



where 9J is the rotation matrix of the body frame with respect to the fixed frame and ^ p is 
the vector of the coordinates of the generic point P of the link in the body frame. 
From (36) it follows 



x P =x +Z R f.^ d ' 



where 



From (37) it is 



from which 






h = l,2,3 



V 2 = x r P x P = x T x + a T ^R T h R^ P ^ Pk d + 2x T ^R^ Ph d , 



±Mx r x + id T ^Xjy> + Mxl ^R^aA ^ 

*■ *• h,k h 



(37) 



(38) 



(39) 



(40) 



£(*? « T ) 



MI M^RJc, 

h 

M^Rlta, T,R T h RJ» 



and hence the proof. 

Remark 6. The vector of the coordinates of the center of mass § G and the inertia matrix 

\I hk j of the f-th link, in the case of a complex structure, may be easily evaluated by making 

use of software packages, such as CATIA™. 

Theorem 7. If the Euler angles of the i-th link a- = \a lt a 2i a 3i ] are the ZYX angles, also 

called RPY angles (Roll-Pitch-Yaw), (see Fig. 10), by omitting subscript i, the matrices Rj, 

result: 
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(41) 






^1 C 2 C 3 






(42) 









s^Cg-CjSj -SjS^-c,^ 



(43) 



Proof. The rotation matrix fli^a, ) of the !-th link, by omitting subscript i, results (43) 



SR(a) = 



C 1 C 2 C 1^2^3 S 1 C 3 C 1 S 2 C 3 ~i~ S 1 S 3 

s i c 2 s^^ + c.|C 3 s^^ ~ c 1 s 3 



(44) 



from which, for (38), the proof. 

The computation of the matrices 3, may be sped up by the following algorithm, where 

subscript i has been omitted. 
Algorithm 1. 

Step 1. Compute 



Step 2. Compute 



s l/ s 2' s 3; c l/ c 2' c 3 



(45) 



Step 3. Compute 



s l s 2' s l s 3' S 2 S 3^ c l c 2' c l c 3' C 2 C 3^ 

s l c 2' s l c 3' S 2 C 3; c l s 2' c l s 3' C 2 S 3; 

(s 1 s 2 )s 3 , (s 1 s 2 )c 3 ; (s lC2 )s 3 , (s 1 c 2 )c 3 ; 

( Cl c 2 )s 3 , (c lC2 )c 3 ; (c lS2 )s 3 , (C X S 2 )C 3 . 



#[#! , R[ R 2 , RlR 3 ; R 2 R 2 , lO>R 3 ; RjR 3 . 



(46) 



(47) 



If the axis ^ of the body frame is chosen such as to contain the center of gravity G, and it 
results principal of inertia then, by choosing the other two axes parallel to the other inertia 
principal axes, the computation of the matrix 3, may be further sped up, as it results from 
the following theorem. 

Theorem 8. If the body reference axes of the i-th link are parallel to the inertia principal axes 
and the center of gravity G, belong to the axis £ij / the matrix 3, results 
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M,I M,)^ 



Proof. The proof easily follows from (35) and from 

$ G2i = 0, i = 2,3; I m =0,h= 1,2, k = h + l,3. 



(48) 



(49) 



Let us consider now a robot with n links (see Fig. 11). The following theorems and 
algorithms for the computation of the inertia matrix A of the whole robot, as a function of 
the absolute angular coordinates a of the robot, hold. 




Fig. 11. Schematic representation of a spatial robot with n links. 

Theorem 9. The kinetic energy of the i-th link of a generic spatial robot may be computed by 
means of the relationship 



1 T 

T i=—a 1 „ J A i a li , 



(50) 



where 



3 221 , if i = 1 

M i y 1 .,_ 1 T y 1 ...,._ 1 v^z, 

""* 12i M...1-1 ->22i 



, ifi=2,3,...,H 



(51) 



in which: 



3 U « = M,2X£ G ,„, 3 22 , = Z R l R nIm 

h h,k 



(52) 
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Proof. From the following relationships (see Fig. 11 and (37)) 

*o, = *o'i-i=*oi-i + H R hi-iZo'hi-i<*i-i > i = 2 >-> n > *m = ° ' 

h 

it is 



(53) 



<oi=[Vt v 2 



K-i] 



: ^i..,-iai...i-i/ 



(54) 



from which 



d, 



VL.M 



(55) 



By substituting (55) in (34), it is 



T 1-T„. 



— a i i 
2 1 -' 



2 l " 



V, 







I 






I 



(56) 



M i y i ..,_ 1 T V 1J _ 1 V^Su, 



from which the proof follows. 

The next theorem, similar to Theorem 8, is useful to simplify the computation of the 

matrices A i . 

Theorem 10. If the axes of the body frame of link j, j = 1,2,..., i - 1 , are parallel to the inertia 

principal axes and the centers of gravity G. and the rotation centers O, belong to the axis 
£,\i , matrices (52) may be more easily calculated by the relations: 



3 12 , = M,R^ G1 „ 3 22l = Y, r I r Jm < 

h 

V 1 .,. 1 =[V 1 V 2 - V M ], V^Rtfa,, ;=1,2,...,»-1. 
Proof. The proof easily follows from Theorem 8 and from the fact that 

6,^=0, ft = 2,3,;=l,2,...,i-l. 



(57) 
(58) 

(59) 



The following theorems may be effectively used to compute the gradient of the gravitational 
potential energy. 

Theorem 11. The gradient of the gravitational potential energy of the j-th link U, with 
respect to the angular coordinates a l t may be calculated by the formula 
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-M;'[k n <f 0i R 21 # o; R 31 4 0i R 12 ^ R 22 4 0i R 32 {o 2 ■ ■ ■ ( 6 °) 

where g is the gravity acceleration vector in the inertial frame. 

Proof. It is easy to verify that the gravitational potential energy of the generic t'-th link 
results 

U f =-M I -g T (SR 1 (a 1 )S o; +« 2 (a 2 )?o 2 +-- + « ! ( a ,)§G,)- (61) 

Relation (60) easily follows from (61). 
The next theorem is useful to simplify the computation of — . 

Theorem 12. If the x 3 -axis of the inertial frame is vertical and upwards oriented, the 
rotation centers O. of the j'-th links, j = 1,2,...,/ -1 , belong to the axes ^; of the body 
frames, the center of gravity G,- of the i-th link belongs to the axis ^,- and the Euler angles 

of the j'-th links, j =1,2,...,/, are the ZYX angles, the gradient — may be calculated 

much more easily by the relationship 

8U { 

=Y = 

8a i-i ' T (62) 

= M lg [o c 21 # 0il c 22 4 dil ■■■ carfci o] . 

Proof. Since the x 3 -axis is vertical and upwards oriented it is g = [0 —g\ , where g is 
the gravity acceleration. Moreover, since O: e £}.■, j = 1,2,...,! -1, G,- s ^ i; : 

#o;i ° °] T '?G, =[fc,i 0] T ,» = l..Jt. (63) 

The proof easily follows from (41)-(43), (60), (63). 

4.3 Application of the main results 

On the base of Theorem 9 the kinetic energy of the whole robot is 

T = T 1+ T 2+ ... + T„ = ia T Ad, (64) 

where the inertia matrix A = A(a) is obtained by assembling the matrices A, according to the 
following efficient algorithm written in Matlab-like language: 
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A=An; 

for i=n-l:-l:l 

A(l:3*i,l:3*i)=A(l:3*i,l:3*i)+Ai; 

end. 

Once the vectors \ ,i = 1,2, ...,n, have been computed by the (60) or the (62), the gradient of 

the gravity energy y is obtained by assembling the y according to the following efficient 

algorithm written in Matlab-like language: 

Y = Y n ; 

for i=n-l:-l:l 

Y (la)- Y (la)+ Y *; 
end. 

Finally, by using the efficient integration scheme shown in Fig. 1 (Celentano & Iervolino, 
2006), it is possible to get the dynamic model of the robot in a simple way. 

4.4 Comparison in terms of efficiency with the articulated-body method 

The efficiency of the method of modeling and simulation proposed for spatial robots has 
been explicitly compared to the efficiency of a Newtonian method of order N considered 
one of the most efficient methods in the literature, i.e. the Articulated-Body Method 
(Featherstone, 1987), (Featherstone & Orin, 2000). 

More in details, such comparison has been executed in terms of number of flops (in 
MATLAB™ environment), for the execution of the same integration step (see Table 1). 



Number of 
links 


Flops of the 
proposed method 


Flops of the 

Articulated-Body 

method 


1 


362 


1269 


2 


2459 


7168 


3 


8400 


13758 


4 


21451 


20804 


5 


45868 


28252 



Table 1. Efficiency comparison between the proposed method and the Articulated-Body 
method. 

From Table 1 it results that for practical spatial robots (which of course have a limited 
number of links), the proposed method is more efficient when n < 4 , although spatial robots 
with three degrees of freedom spherical joints have been considered. Moreover, it is 
important to note that the great easiness of the proposed method with respect to the 
Articulated-Body one is an important index of its efficiency. 
However, the authors have the plan to continue and to improve this comparison. 



5. Elements of flexible robots modeling 

5.1 Methodology 

The results stated in the previous sections allow obtaining, quite simply, accurate and 

efficient, from a computational point of view, finite-dimensional models. 

These models, for rectilinear links, can be obtained approximating the i-th link as follows: 
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in order to compute the kinetic energy and, therefore, the inertia matrix and the 
gradient of the kinetic energy, with v,+l rigid sublinks with lengths L ll ,...,L v+li , 

interconnected by relative angles j3 V) ,...,ji vi (see Fig. 12); 

in order to compute the elastic energy, with v j flexible sublinks joined and deformable 

such that their axes make a suitable spline tangent to the axes of the intermediate rigid 
sublinks in the extreme points of the link and to the intermediate rigid sublinks in 
suitable interior points (see Fig. 12). 




Fig. 12. Approximation of a flexible link with v t + 1 = 5 rigid sublinks and with v t = 4 
flexible sublinks congruent to each other at the end. 

In this way, neglecting, about the computation of the kinetic energy, the deformation angles 
respect to the angles of motion, it is easy to verify, from the results of Sections 2 and 3, that 
the inertia matrix can be computed as follows: 

B = B(/S 2 , ...,£)= (65 , 

= 8^32 cos/? 2 + ... + B„ cos/?„ + B 23 cos(/? 2 + /? 3 ) + B 24 cos(/? 2 + /? 3 + /? 4 ) + ... + B n _ ln cos(/?„., +/?„) ' K ' 

where /?, is the angle of motion between the [i -1) -th link and the i -th link, i = 2,...,n (see 
Fig. 13). 

From (65) it is easy to compute also the gradient of the kinetic energy respect to 
P = [A Ai-A-,1 A Az-Az A A„-A,„] 7 usin S the following relation 



■t dB ■ - T dB ■ 



(66) 



There are different ways of approximating a flexible link using a chain of rigid sublinks and 
flexible sublinks congruent to each other at the end (Celentano, 2007). 
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Axis of the flexible link i 



Axis of the flexible link i -1 

Fig. 13. Angle of motion between two flexible consecutive links. 

In the hypothesis that the stiffness of the i-th link is uniform, a very efficient way for the 
choice of the sublinks is to consider the intermediate rigid sublinks with a length of L, / v t , 
the extreme ones with an half length, the flexible sublinks with a length of L t / v i and 
deformable as parabolic arcs. Consequetly the elastic energy of the i-th link results 

U,=\K,(Pl+- + Pl,), (67) 

where 

*S=^f- (68) 

is the stiffness of the flexible sublinks. 
Therefore the vector of the elastic torques is 

y = l0KJ u K 1 f} 21 ---KJ Vil 0K 2 /3 l2 KJ 22 ---K 2 /3 Vi2 ---0KJ ln KJ 2n ---KJ v J. (69) 

Remark 7. The proposed method is simple to apply, is very efficient from a computational 
point of view and it provides a model with null error when the control and disturbance 
torques are constant. For control actions and disturbances, also in forces, with a fixed and 
limited band-pass, this model provides an error strongly decreasing when the number of 
sublinks increase. This fact clearly emerges from the following example. 

5.2 Example: Modeling of a Single-Link Flexible Arm 

Consider a flexible robot with a single flexible link (see Fig. 14). 

If the link is a steel bar with a length of L = 1m and with a squared section of side 

/ = 0.01m = 1cm , the first five frequencies computed with the theoretic formulae: 

^ = \P^( 4/ + 1 ) 2 ' »' = 1 ' 2 '-' 5 ( 7 °) 

V m 32L 

where E is the modulus of elasticity of Joung (slO u N/m 2 ), I = / 4 /12 is the moment of 
inertia of the section respect to the deflection axis and m = 7876l 2 Kg / m is the mass per unit 
length of the link, are: 
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/ = 6.31, 20.4, 42.7, 73.0, 111.3 Hz. 



(71) 




Fig. 14. Schematic representation of a single-link flexible arm. 

The frequencies obtained with the proposed method, dividing the link into five and ten 
flexible sublinks, respectively result: 



/ = 6.31, 20.4, 41.9, 65.6, 90.8Hz 
/ = 6.31, 20.4, 42.6, 72.77, 110.5Hz . 



(72) 
(73) 



Stressing the link with C c = -INm, C d = INm, F d = ON , if ft = , the theoretic values of 
a and 8 are: 



L L 2 

a = — C d =1.375°, 8 = C,= 2.40cm 

EI 2EI 



(74) 



It is worth noting that these values and the ones obtained using the proposed method are 
coincident W>1 (Celentano, 2007). 

Applying to the link C c = -2Nm, C d = ONm, F d = IN , if j3 = , the theoretic values of 
a and S result: 



L 2 L 3 

a = F, = 1.375°, 8 = F, = 3.20cm . 

2EI 3EI 



(75) 



The first value (the orientation angle of payload due to the arm deflection) and the ones 
computed with the proposed method V v > 1 are coincident, while the second value (the 
motion of the payload due to the arm deflection) obtained using the proposed method has a 
relative error of (Celentano, 2007) 



-0.01=1%, ifv = 5 
4v 2 1-0.0025 = 0.25%, if v = 10. 



(76) 



6. Conclusion 

In this chapter an innovative method for robots modeling and simulation, based on an 
appropriate mathematical formulation of the relative equations of motion and on a new 
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integration scheme, has been illustrated. The proposed approach does require the 
calculation of the inertia matrix and of the gradient of the kinetic energy only. It provides a 
new analytical-numerical methodology, that has been shown to be simpler and numerically 
more efficient than the classical approaches, requires no a priori specialized knowledge of 
the dynamics of mechanical systems and is formulated in order to allow students, 
researchers and professionals to easily employ it for the analysis of manipulators with the 
complex-shaped links commonly used in industry. 

In the case of planar robots with revolute joints, theorems have been stated and proved that 
offer a particularly simple and efficient method of computation for both the inertia matrix 
and the gradient of the kinetic energy. Then a comparison has been made in terms of 
efficiency between the proposed method and the Articulated-Body one. 

Moreover, for spatial robots with generic shape links and connected, for the sake of brevity, 
with spherical joints, several theorems have been formulated and demonstrated in a simple 
manner and some algorithms that allow efficiently computing, analytically the inertia 
matrix, analytically or numerically the gradient of the kinetic and of the gravitational energy 
have been provided. Furthermore, also in this case a comparison of the proposed method in 
terms of efficiency with the Articulated-Body one has been reported. 

Finally, a methodology for flexible robots modeling, that allow obtaining, quite simply, 
accurate and efficient, from a computational point of view, finite-dimensional models, has 
been provided. This method is illustrated with a very significant example. 
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1. Introduction 



Nowadays, the computational power of the average personal computer provides to a vast 
audience the possibility of simulating complex models of reality within reasonable time 
frames. This chapter presents a review of modelling techniques for underwater vehicles 
with fixed geometry, giving emphasis on their application to real-time or faster than real- 
time simulation. 

In the last decade there was a strong movement towards the development of Autonomous 
Underwater Vehicles (AUV) and Remotely Operated Vehicles (ROV). These two classes of 
underwater vehicles are intended to provide researchers with simple, long-range, low-cost, 
rapid response capability to collect pertinent environmental data. There are numerous 
applications for AUV and ROV, including underwater structure inspection, oceanographic 
surveys, operations in hazardous environments, and military applications. In order to fulfil 
these objectives, the vehicles must be provided with a set of controllers assuring the desired 
type of autonomous operation and offering some aid to the operator, for vehicles which can 
be teleoperated. 

The design and tuning of controllers requires, on most methodologies, a mathematical 
model of the system to be controlled. Control of underwater vehicles is no exception to this 
rule. The most common model in control theory is the classic system of differential 
equations, where x and u are denominated respectively state vector and input vector: 

x = f(x,u) (1) 

In this framework, the most realistic models of underwater vehicles require f(x,u) to be a 

nonlinear function. However, as we will see, under certain assumptions, the linearization of 

f(x,u) may still result in an acceptable model of the system, with the added advantage of the 

analytic simplicity. 

On the other hand, simulation, or more specifically, numerical simulation, does not require 

a model with the conceptual simplicity of Eq.(l). In this case, we are not concerned with an 

analytic proof of the system's properties. The main objective is to compute the evolution of a 

set of state variables, given the system's inputs. 

The hydrodynamic effects of underwater motion of a rigid body are well described by the 

Navier-Stokes equations. However, these equations form a system of nonlinear partial 
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differential equations whose solution is very hard to compute for general problems. 
Computational fluid dynamics deal with the subject of solving these equations. However, 
even with today's computer technology and software packages, the time to obtain the 
solution on average computer, even for simple scenarios, is still far from real-time. This kind 
of simulation may return very accurate results, but the computation times are not acceptable 
if simulation of long term operation is desired. These models are just too complex for control 
design and approximations are used for this purpose. 

If near, or faster than, real-time simulations are desired, acceptable results can be obtained 
by simulating the model employed in the control design. Real-time response is most useful 
when teleoperation simulation is desired. In this case, as in car or flight simulators, the user 
interacts with the underwater vehicle simulator by setting references or direct actuator 
commands on a graphical user interface. 

Usually we are interested in checking the performance of the vehicle's controllers in a set of 
operation scenarios. This is done because the control design may involve certain 
simplifications or heuristic methods which make it difficult to analytically characterize 
certain parameters of the system's response, such as settling time or peak values 
(overshoots) during transient phases of operation. Simulation is an important tool for 
controller tuning and for exposing certain limit situations (e.g., actuator saturation) that may 
be hard to describe analytically on the model employed for control design. 
The typical design cycle involves the test of different control laws and navigation schemes. 
In most cases, the control system must be replicated in a simulation environment, usually on 
a different language. Even when that is done correctly, it is difficult to keep consistency 
between that implementation and the final control system, which may be subject to updates 
from other sources. It is possible, as described in (Silva et al. 2007), to have a single 
implementation of the control software to function unmodified in both real-life and 
simulated environments. Instead of writing separate code first for a prototyping 
environment and then for the final version, this approach allows the employment of the 
stable/final software in the overall design cycle. Therefore, the simulation may be seen also 
as a debugging tool of the overall software design process. Underwater vehicle's mission 
management, with special regards to autonomous operation, may involve complex logic, 
besides the continuous control laws. It is of the major importance to test the implementation 
of that logic, namely switching between manoeuvres, manoeuvre coordination, event 
detection, etc. Since real-life missions may last for some hours, it is quite useful to simulate 
these missions in compressed simulated time. 

The literature from naval architecture proposes several models for underwater vehicles 
following the structure of Eq.(l). The main difference between these models is the way how 
the hydrodynamic phenomena associated with underwater rigid body motion is modelled. 
The model described in (Healey & Lienard, 1993) is used in many works. These authors 
refine a model that can be traced back to 1967 (Gertler & Hagen, 1967) in order to describe a 
box shaped AUV. Most recent works use the framework presented on (Fossen, 1994) which, 
while less descriptive than the one of (Healey & Lienard, 1993), is more amenable to direct 
application of tools from nonlinear control. Recent research on underwater vehicle's motion 
equations can still be found, for instance on (Nahon, 2006). 

However, in general, the literature only provides the general equations of the models. These 
models are parametrized by tens (sometimes over one hundred) of coefficients. Some of 
these coefficients can be easily computed based on direct physical measurements (mass, 
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length, etc.). However, the computation of the coefficients related to hydrodynamic effects is 
not a straightforward task. When considering new designs, the accurate estimation of some 
of the coefficients, mainly those associated with hydrodynamic phenomena, usually 
requires hydrodynamics tests. Although ingenious techniques can be used, see for instance 
(von Ellenrieder, 2006), these tests are usually expensive or involve an apparatus which is 
not justifiable for every institution. 

Certain software packages can be used to obtain more accurate parameters. For instance, 
(Irwin & Chavet, 2007) present a study comparing results obtained with Computational 
Fluid Dynamics with those of classical heuristic formulas. However, software packages for 
this purpose are usually expensive, or of limited accessibility. 

The simplest alternative approaches rely on empirical formulas, or on adapting the 
coefficients from well-proved models from similar vehicles. In fact, empirical results show 
that, for vehicles of the same shape, the hydrodynamic effects can be normalized as a 
function of scale and vehicle's operating speed. For instance, the work of (Healey & Lienard, 
1993) presents the complete set of numeric parameters for the model proposed by the 
authors on that work. However, the later method is suitable only for vehicles with similar 
shape to those whose models are known and the former requires the vehicle being modelled 
to follow closely the assumptions of the empirical formulas. 

In what follows, we review a standard nonlinear model derived from (Fossen, 1994), 
describe further aspects of the hydrodynamic phenomena, and explain how the symmetries 
of the vehicle can be explored in order to reduce the number of considered coefficients. 
The final conclusions are drawn based on our experiments with the Light Autonomous 
Underwater Vehicle (LAUV) designed and built at University of Porto (see Fig. 1). LAUV is 
a torpedo shaped vehicle, with a length of 1.1 meters, a diameter of 15 cm and a mass of 
approximately 18 kg. The actuator system is composed of one propeller and 3 or 4 control 
fins (depending on the vehicle version), all electrically driven. We compare trajectories 
logged during the operation of the LAUV with trajectories obtained by simulation of the 
vehicle's mathematical model. 




Fig. 1. Light Autonomous Underwater Vehicle (LAUV) designed and built at University of 
Porto. 

2. Underwater vehicle dynamics 

When discussing underwater vehicle dynamics we typically consider two coordinate 
frames: the Earth-Fixed Frame and the Body-Fixed Frame. 
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The Earth-Fixed Frame defines a coordinate system with origin fixed to an arbitrary point 
on the surface of the Earth and following the north-east-down convention: x points due 
North, y points due East, and z points toward the center of the Earth. For marine 
applications, this frame is considered the inertial frame. 

In the Body-Fixed Reference Frame the origin and axes of the coordinate system are fixed 
with respect to the (nominal) geometry of the vehicle. The orientation of the axes is as 
shown on Fig. 2: if the underwater vehicle has a plane of symmetry (and we will assume 
here that they all do) then xg and zg lie in that plane of symmetry, xg is chosen to point 
forward and zg is chosen to point downward. Usually the body axes coincide with the 
principal axes of inertia of the vehicle. Fig. 2 shows one possibility. The origin of the body- 
fixed frame is frequently chosen to coincide with the center of gravity. This is a natural 
choice given the equations of rigid body motion. However, in many situations, most 
remarkably during prototyping, the center of mass may be changing relatively to the 
vehicle's geometry. That makes necessary to recalculate the moments due some of the forces 
involved on vehicle's motion (e.g., forces due to the actuators). Therefore, in those cases, a 
more useful choice would be a point relative to the vehicle's shape such as the center of 
pressure (described later) or simply the geometrical center. 




Fig. 2. Body-Fixed Reference Frame. 

The minimum set of variables that completely describe the vehicle's position, orientation, 
and linear and angular velocities is called vehicle state. The most commonly chosen state 
variables are the inertial position and orientation of the vehicle, and its body-fixed linear 
and angular velocities. The orientation of the vehicle with respect to inertial space can be 
described by the Euler angles. These angles are termed yaw angle (\|/), pitch angle (9) and 
roll angle (((>). This implies three different rotations will be needed (one for each axis). The 
order in which these rotations are carried out is not arbitrary. The standard coordinate 
systems and rotations (Lewis, 1989), are as defined in Fig. 3. In what follows, the notation 
from the Society of Naval Architects and Marine Engineers (SNAME) is used (Lewis, 1989). 
The motions in the body-fixed frame are described by 6 velocity components u, v, w, p, q 
and r. Let us define the following vectors: 

V-L = |u v wJ T (1) 



\p 



(2) 



<[>l vtf 



(3) 



The body fixed linear velocities u, v and w are termed, respectively, surge, sway and heave. 
We adopt the following convention: when considering slow varying ocean currents, these 
velocities are relative to a coordinate frame moving with the ocean current. 
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The vehicle's position and orientation in the inertial frame are defined by the following 
vectors: 



■Hi = t x y z ] T 

n 2 =[<|> 6 v|/] T 

[T T 1 T 

Hi 1I2J 



(4) 
(6) 
(5) 



*3 X- 




X 2 -4 



► Y3 




► yi 




Fig. 3. Euler angles definition: yaw, pitch and roll. 

The twelve basic states of an underwater vehicle may therefore be written as (again, this is 
just one possible choice, but it is the standard one): 



Name 


Description 


Unit 


u 


Linear velocity along body-fixed x-axis (surge) 


m/s 


V 


Linear velocity along body-fixed y-axis (sway) 


m/s 


w 


Linear velocity along body-fixed z-axis (heave) 


m/s 


P 


Angular velocity about body-fixed x-axis 


rad/s 


q 


Angular velocity about body-fixed y-axis 


rad/s 


r 


Angular velocity about body-fixed z-axis 


rad/s 


V 


Heading angle with respect to the reference axes 


rad 


e 


Pitch angle with respect to the reference axes 


rad 


* 


Roll angle with respect to the reference axes 


rad 


X 


Position with respect to the reference axes (North) 


m 


y 


Position with respect to the reference axes (East) 


m 


z 


Position with respect to the reference axes (Down) 


m 



Table 1. Vehicle States 

2.1 Dynamic equations 

The evolution of r| is defined by the following kinematic equation: 



T| = J(n 2 ) v 



(6) 
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This equation defines the relationship between the velocities on both reference frames, with 
the term J(r|2) given as follows: 



Ji(l 2 ) = 



cos8cos\|/ (-cos(|)sin\|/ + sin(|)sin9cosv|/) (sin(])sinv|/ + cos(t>sin6cosi|/) 
cos9sin\|/ (cost|)cosv|/ + sin(|)sin6sinv|/) (-sin(|)cos\|/ + cos(|)sin6sinv|/) 
-sinG cos6sin\|/ cos6cos<|) 



(7) 



J 2 (T1 2 ) 



( 1 



V cos 6 



cos 9 sin (() sin 6 cos <() sin 8 
cos(()cos6 -sin(()cos(: 
sin (j) cos <]) 



(8) 



J(n 2 )= 



o J 2 (n 2 ) 



(9) 



Note that there is a singularity in Eq. (8) when the pitch angle is +90°. Underwater vehicles 
typically do not operate close to this singularity. If we want to consider operation in that 
zone, one possible alternative to the Euler angles is the quaternion representation. For 
further details see, for instance, (Fossen, 1994). 

For most marine control applications it is valid to assume the principle of superposition can 
be applied when considering environmental disturbances. For underwater vehicles 
operating far from the surface, the main environmental disturbance to be considered is due 
to marine currents. Ocean and river currents arise due to a multitude of factors but, in the 
generality of the operations, they have a low spatial and temporal rate of change with 
respect to the vehicle dynamics. Therefore we can incorporate the effect of currents in the 
vehicle model by considering that v is relative to a frame moving with the water current 
velocity v c , where v c = [v cx , v cy , v cz , 0, 0, 0] is the earth-fixed current velocity vector. The only 
change to the previously defined model will be in the kinematic equations which will come 
as follows: 



ii = J(r| 2 )v + v c 



(10) 



The equations of motion are composed of the standard terms for the motion of an ideal rigid 
body plus the terms due to hydrodynamic forces and moments and the terms due to 
propulsive forces. The usual approach to model the hydrodynamic terms consists of 
considering the following effects: restoring forces, the simplest one, which depends only on 
the vehicle weight, buoyancy and relative positions of the centers of gravity and buoyancy; 
added mass, which describes the pressure induced forces/moments due to forced harmonic 
motion of the body; drag, caused by skin friction (either laminar or turbulent) and vortex 
shedding; and lift, due to the fluid flow whose direction is changed perpendicularly to 
vehicle's linear velocity vector. As hinted in the introduction, added mass, drag and lift are 
very hard to describe accurately by theoretical methods. An extensive discussion about the 
added mass effect is presented in (Brennen, 1982). 

The drag effect is modelled as a force opposing to the vehicle's velocity. The lift effect is 
modelled as a force perpendicular to the vehicle's velocity. Both forces act on the center of 
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pressure of the vehicle and are a function of the vehicles' shape and of the square of its 
velocity. The center of pressure is also strongly dependent on vehicle's shape. For a more in- 
depth analysis of this subject see, for instance, (Hoerner, 1992). 

With the exception of the gravity and buoyancy forces, these effects are best described in the 
Body-Fixed Frame. Therefore, the remaining equations of motion, describing the vehicle's 
kinetics, can be presented in the following compact form: 

Mv + C(v)v + D(v)v + g(r|) = T act (11) 

M is the constant inertia and added mass matrix of the vehicle, C(v) is the Coriolis and 
centripetal matrix, D(v) is the Damping matrix, g(r|) is the vector of restoring forces and 
moments and Tact is the vector of body-fixed forces from the actuators. We follow the 
common formulation where the lift and drag terms are both accounted in the damping 
matrix. 

For vehicles with a streamlined shape, theoretical and empirical formulas may be used. 
However, it must be remarked that in practice these vehicles are not quite as regular as 
assumed in the formulas usually employed for added mass, drag and lift: they have 
antennas, transducers and other protuberances that affect those effects, with special 
incidence on the drag terms. Therefore we should look at the formulas as giving 
underestimates of the true values of the coefficients. 

In certain situations it may be useful to consider the following simplifications: if the 
vehicle's weight equals its buoyancy and the center of gravity is coincident with the center 
of buoyancy, g(r\) is null; for an AUV with port/ starboard, top/bottom and fore/aft 
symmetries, M and D(v)=Di(v)+D2(v) are diagonal. In the later case, the damping matrix has 
the following form: 

D 1 (v) = diag(X„,Y v ,Z w ,K p ,M q ,M q ) (12) 

D 2 (v) = diag(X uH | u | ,Y v|v| | v | ,Z w|w| | w | ,K p|p| | p | ,M q|qj | q | ,N |r| | r | ) (13) 

For low velocities, the quadratic terms on Eq. 13, such as Y v | v | | v | , may be considered 
negligible. However, in practice, the fore/ aft symmetry is rarely verified and non-diagonal 
terms should be considered. Even so, certain simplifications can be further considered. For 
instance, in torpedo shaped vehicles, some of the coefficients affecting the motion on the 
vertical plane are the same as those affecting the motion on the horizontal plane, reducing 
the number of different coefficients that must be estimated. 

Some of the models found in the literature, e.g. (Prestero 2001; Leonard & Graver, 2001; 
Conte & Serrani, 1996; Ridley et al., 2003), do not consider the linear damping terms 
contained on Di(v). These terms may play an important role in the design of the control 
system, namely on local stability analysis. For low velocities scenarios the quadratic 
damping terms become very small. If the linear damping is ignored, the linearization of the 
system model around the equilibrium point may falsely reveal a locally unstable system. 
This leads the control system designer to counteract by adding linear damping in the form 
of velocity feedback, which potentially could be unnecessary, leading to conservative 
designs. In fact, it is possible to find examples in the literature where the authors perform a 
worst case analysis, by totally disregarding the damping matrix (Leonard 1996; Chyba 2003). 
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2.2 Actuators 

In the last years there has been a trend in the research of biologically inspired actuators for 
underwater vehicles, see for instance (Tangorra, 2007). The development of vehicles 
employing variable buoyancy and center of mass (e.g., gliders) is also underway 
(Bachmayer, 2004). However, the preferred types of actuators for small size AUVs still are 
electrically driven propellers and fins, due to its simplicity, robustness and low cost. 
When high manoeuvrability is desired, full actuation is employed (for instance, with two 
longitudinal thrusters, two lateral thrusters and two vertical thrusters). For over-actuated 
vehicles, thruster allocation schemes may be applied in order to optimize performance and 
power consumption. However, for a broad range of applications the cost effectiveness of 
under-actuated vehicles is still a factor of preference. In those cases, a smaller number of 
thrusters, eventually coupled with fins, is employed. This approach is applied in most 
torpedo-like AUVs: there is a propeller for actuation in the longitudinal direction and fins 
for lateral and vertical actuation. In this case, z a a depends only on 3 parameters: propeller 
velocity, horizontal fin inclination and vertical fin inclination. 

Dynamic models for propellers can be found in (Fossen, 1994) and this is still an active area 
of research (D'Epagnier, 2006). However, the dynamics of the thruster motor and fin servos 
are generally faster than the remaining dynamics. Therefore, they can be frequently 
excluded from the model, namely when operation at steady speed is considered as opposed 
to dynamic positioning, or station keeping. 

2.3 Simplified models 

For a large class of underwater vehicles it is usual to consider decoupled modes of 
operation, see for instance (Healey & Lienard, 1993), the most common being motion on the 
horizontal plane, involving changes on x, y, \|/, v, and motion on the vertical plane aligned 
with the body fixed x-z axes, involving changes on z, 9, w and q. In the later mode, 
assuming small deviations from on the pitch angle, a linearized model can be used 
without introducing significant error (the aij, k w and k q coefficients can be calculated as a 
function of the coefficients of the full nonlinear model): 
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(14) 



For the purpose path planning on the horizontal plane with piecewise continuous velocity, a 
simple kinematic model can be used: 



x = u cos(\|/) - v sin(v|/) + v c . 

y = vcos(v|/) + usin(v|/) + v c 

vj/ =r 



(15) 



For the purpose of path planning, it is considered that the actuators produce the desired 
velocities instantaneously. The allowable ranges for u, v and r must be the same as the ones 
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verified for the full dynamic model, or measured in real operation. While this model 
introduces some errors that must be compensated later by the on-line control system, this is 
very useful for the general path planning algorithms. If the vehicle does not possess lateral 
actuation, such as a torpedo, the model drops the terms on v and becomes the well-known 
unicycle model. 

3. Results and discussion 

In (Silva et al., 2007) we describe a simulation environment which allows us to simulate 
AUV operation in real-time and with direct interaction with the control software. All 
software was written in C++ and is based on the Dune framework, also developed at the 
University of Porto. Using this framework, the control software and simulation engine may 
run either on a desktop computer or on the final target computer. Our results show that 
realistic real-time and faster than real-time simulation of underwater vehicles is quite 
feasible in today's computers. The trajectories obtained with the exact same inputs as those 
used in experiments in the water differ slightly from the real trajectories. However, in what 
concerns simulation of closed-loop operation, the feedback employed on the control laws 
smoothes out the effects of parameter uncertainty. Therefore it is possible to observe a good 
correlation between the performance of the controlled system in simulation and that 
obtained in real operation. This conclusion is drawn using the exact same controllers and 
timings on simulation and real operation. This result is not as assuring as a complete 
analytical proof but, then again, none of the currently employed models are perfect 
descriptions of the reality therefore, even an analytical study does not guarantee the 
planned behaviour when the respective implementation comes to real life operation. 
The available methods are quite satisfactory for high level mission planning and already 
provide a good basis for initial controller tuning. However, additional tuning is still 
required when it comes to real life vehicle operation. Research on models whose simulation 
can be done in reasonable time while providing an increasing level of adherence to reality 
should continue. 
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1. Introduction 

Recently there has been significant growth in the use of fuzzy logic in industrial and 
consumer products (J. Yen 1995). However, although fuzzy control has been successfully 
applied to many industrial plants that are mostly nonlinear systems, many critics of fuzzy 
logic claim that there is no such thing as a stability proof for fuzzy logic systems in closed- 
loop control (Reznik 1997; Farinwata, Filev et al. 2000). Since fuzzy logic controllers are 
classified as "non-linear multivariable controllers" (Reznik 1997; Farinwata, Filev et al. 
2000), it can be argued that all stability analysis methods applicable to these controller types 
are applicable to fuzzy logic controllers. Unfortunately, due to the complex non-linearities of 
most fuzzy logic systems, an analytical solution is not possible. Furthermore, it is important 
to realize that real, practical problems have uncertain plants that inevitably cannot be 
modelled dynamically resulting in substantial uncertainties. In addition the sensors noise 
and input signal level constraints affect system stability. Therefore a theory that is able to 
deal with these issues would be useful for practical designs. The most well-known time 
domain stability analysis methods include Lyapunov's direct method (Wu & Ch. 2000; 
Gruyitch, Richard et al. 2004; Rubio & Yu 2007) which is based on linearization and 
Lyapunov's indirect method (Tanaka & Sugeno 1992; Giron-Sierra & Ortega 2002; Lin, 
Wang et al. 2007; Mannani & Talebi 2007) that uses a Lyapunov function which serves as a 
generalized energy function. In addition many other methods have been used for testing 
fuzzy systems stability such as Popov's stability criterion (Katoh, Yamashita et al. 1995; 
Wang & Lin 1998), the describing function method (Ying 1999; Aracil & Gordillo 2004), 
methods of stability indices and systems robustness (Fuh & Tung 1997; Espada & Barreiro 
1999; Zuo & Wang 2007), methods based on theory of input/output stability (Kandel, LUO 
et al. 1999), conicity criterion (Cuesta & Ollero 2004). Also there are methods based on 
hyper-stability theory (Piegat 1997) and linguistic stability analysis approach (Gang & Laijiu 
1996). 

Fuzzy logic uses approximate reasoning and in this chapter a practical algorithm to improve 
system stability by using a fuzzy stabilizer block in the feedback path is introduced. The 
fuzzy stabilizer is tuned such that its nonlinearity lies in a bounded sector resulting from the 
circle criterion theory (Safonov 1980). The circle criterion presents the sufficient condition 
for absolute stability (Vidyasagar 1993). An appealing aspect of the circle criterion is its 
geometric nature, which is reminiscent of the Nyquist criterion. It is a frequency domain 
method for stability analysis and has been used by Ray et al (1984) to ensure fuzzy system 
stability (Ray, Ghosh et al. 1984; Ray & Majumderr 1984). 
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Throughout this chapter we use a practical approach to stabilize fuzzy systems with the aid 
of the circle criterion theory using a Takagi-Sugeno fuzzy block in the feedback loop of the 
closed system. The new technique is used to ensure stability for the proposed robot fuzzy 
controller. Furthermore, the study indicates that the fuzzy stabilizer can be integrated, with 
minor modifications, into any fuzzy controller to enhance its stability. As a result, the 
proposed design is suitable for hardware implementation even permitting relatively simple 
modification of existing designs to improve system stability. In addition an extension to the 
approach to stabilize MIMO (Multi-input Multi-output) systems is also presented. 



2. Problem formulation and analysis 

This chapter concentrates on the stability of a closed loop nonlinear system using a Takagi- 
Sugeno (T-S) fuzzy controller. Fuzzy control based on Takagi-Sugeno (T-S) fuzzy model 
(Babuska, Roubos et al. 1998; Buckley & Eslami 2002) has been used widely in nonlinear 
systems because it efficiently represents a nonlinear system by a set of linear subsystems. 
The main feature of the T-S fuzzy model is that the consequents of the fuzzy rules are 
expressed as analytic functions. The choice of the function depends on its practical 
applications. Specifically, the T-S fuzzy model is an interpolation method, which can be 
utilized to describe a complex or nonlinear system that cannot be exactly modelled 
mathematically. The physical complex system is assumed to exhibit explicit linear or 
nonlinear dynamics around some operating points. These local models are smoothly 
aggregated via fuzzy inferences, which lead to the construction of complete system 
dynamics. 

Takagi-Sugeno (T-S) fuzzy controller is used in the feedback path as shown in Fig.l, so that 
it can change the amount of feedback in order to enhance the system performance and its 
stability. 




Fig. 1 The proposed System block diagram 

The proposed fuzzy controller is a two-input one-output system: the error e(t) and the 
output y(t) are the controller inputs while the output is the feedback signal q>(t). The fuzzy 
controller uses symmetric, normal and uniformly distributed membership functions for the 
rule premises as shown in Fig.2(a) and 2(b). Labels have been assigned to every membership 
function such as NBig (Negative Big) and PBig (Positive Big) etc. Notice that the widths of 
the membership functions of the input are parameterized by L and h which are used to tune 
the controller and limited by the physical limitations of the controlled system. 
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Fig. 2 (a) The membership distribution of the 2nd input, open loop output y(t) 
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Fig. 2 (b) The membership distribution of the 1st input, the error e(t) 

While using the T-S fuzzy model (Buckley & Eslami 2002), the consequents of the fuzzy 
rules are expressed as analytic functions which are linearly dependent on the inputs. In 
present case, three singleton fuzzy terms are assigned to the output such that the consequent 
part of the i lh rule <p c i is a linear function of one input y(t) which can be expressed as: 



<p$(t) = r t My{t) 



(1) 



where 



ri takes the values -1, 0, 1 

(depends on the output's fuzzy terms) 

y(t) is the 2nd input to the controller 

M is a parameter used to tune the controller. 

The fuzzy rules are formulated such that the output is a feedback signal inversely 
proportional to the error signal as follow: 

IF the error is High THEN <p[ =M y(t) 

IF the error is Normal THEN <p\ =0 

IF the error is Low THEN <p\=-My{t) 

The fuzzy controller is adjusted by changing the values of L, h and M which affect the 

controller nonlinearity map. Therefore, the fuzzy controller implements these values 
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equivalent to the saturation parameters of standard saturation nonlinearity (Jenkins & 

Passino 1999). 

Before studying the system stability, a general model of a Sugeno fuzzy controller is defined 

(Thathachar & Viswanath 1997; Babuska, Roubos et al. 1998; Buckley & Eslami 2002) as 

follows: 

For a two-input T-S fuzzy system; let the system state vector at time t be: 

z l 

z 2. 

where zi, and Z2 are the state variable of the system at time t. 
A T-S fuzzy system is defined by the implications such that: 

R i : if ( z , is S I AND z 2 is S \ ) then 

z = A „ z + B „ 

and for the proposed system where B n is taken as a zero matrix and n = 2 for the two-input 
system, then: 

R j : if ( z | is S ( AND z 2 is S \ ) then 

z = A j z j + A 2 z 2 

for i = l N, 

where S'j , S'2 are the fuzzy set corresponding to the state variables zi, zi and Ri . 
A„=[Ai, AJ, are the characteristic matrices which represent the fuzzy system. 
However the truth value or weight of the implication Ri at time t denoted by w;(z) is defined 
as: 

w,(z)= (/j s ,(z 1 ),/j s ^z 2 )) 

where 

fis(z) is the membership function value of fuzzy set S at position z 

A is taken to be the min operator 

Then the system state is updated according to (Reznik 1997): 



(2) 



N 

Y i w i (z)A i z 


N 


i=l 


=2:^(^2 


N 


2>,-(z) 
i=i 


t=\ 



where S t (z) - 



N 

E w P ( z ) 
P =\ 

However, the consequent part of the proposed system rules is a linear function of only one 
input y(t) as mentioned in the pervious section, and therefore the output of the fuzzy 
controller is of the form: 
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y 



-ZMyWiy 
1=1 



(3) 



where N is the number of the rules 

Mi is a parameter used for the i th rule to tune the controller 

Notice that Eq. 3 directly depends on the input y(t) and indirectly depends on e(t) which 
affects the weights 5i. Thus the proposed system can be redrawn as shown in Fig. 3 



x(t) +^ 


1 ± 

\ 


G 




m 


1 *i 

1 ». 






1 














- 






Fuzzy 






<m 


Controller 







Fig. 3 The equivalent block diagram of the proposed system 

The stability analysis of the system considers the system nonlinearities and uses circle 
criterion theory to ensure stability. 

3. Stability analysis using circle criterion 

In this section the circle criterion (Ray, Ghosh et al. 1984; Ray & Majumderr 1984; 
Vidyasagar 1993; Jenkins & Passino 1999) will be used for testing and tuning the controller 
in order to ensure the system stability and improve its output response. The circle criterion 
was first used in (Ray, Ghosh et al. 1984; Ray & Majumderr 1984) for stability analysis of 
fuzzy logic controllers and as a result of its graphical nature; the designer is given a physical 
feel for the system. 
The output of the system given by Eq. 3 can be rewritten as follow: 



N 

1=1 



(4) 



This comprises a separate linear part and nonlinear part denoted as tp(t) that can be 
expressed by (Vidyasagar 1993; Cuesta, Gordillo et al. 1999): 



= I[(i-$O0)^ 



(5) 



i=l 



As a result a T-S fuzzy system can be represented according to a LUR'E system (Vidyasagar 
1993; Cuesta, Gordillo et al. 1999). Consider a closed loop system, Fig. 4, given a linear time- 
invariant part G (a linear representation of the process to be controlled) with a nonlinear 
feedback part cp(t) (represent a fuzzy controller). 
The function cp(t) represents memoryless, time varying nonlinearity with: 



(p:[0,oo)xK->.tt 
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Fig. 4 T-S Fuzzy System according to the structure of the problem of LUR'E 
If cp is bounded within a certain region as shown in Fig. 5 such that there exist: 

a, p, a, b, (P>a, a<0<b) for which: 

ay < cp{y) < fy 




Fig. 5 Sector Bounded Nonlinearity 

for all t > and all y e [a, b] then: tp(y) is a "Sector Nonlinearity" 



(6) 
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If ay < <p(y) < Py is true for all y e (-co,co) then the sector condition holds globally and the 
system is "absolutely stable". The idea is that no detailed information about nonlinearity is 
assumed, all that known it is that cp satisfies this condition (Vidyasagar 1993). 

Let D(a, P) denote the closed disk in the complex plane centred at — , with radius 

2aP 

\a-0\ -1 -l 
L and the diameter is the line segment connecting the points 1 y'O and v y'O . 



i\ap\ " ° ° a P 

The circle criterion states that when cp satisfies the sector condition Eq.6 the system in Fig.3 
is absolutely stable if one of following conditions are met (Vidyasagar 1993): 

• If < a < p, the Nyquist Plot of G(jw) is bounded away from the disk D(a, P) and 
encircles it m times in the counter clockwise direction where m is the number of poles 
of G(s) in the open right half plane(RHP). 

• If = a < p, G(s) is Hurwitz (poles in the open LHP) and the Nyquist Plot of G(jw) lies 

to the right of the line s = — . 
8 

• If a < < p, G(s) is Hurwitz and Nyquist Plot of G(jw) lies in the interior of the disk 
D(a, P) and is bounded away from the circumference of D(a, P). 

For the fuzzy controller represented by Eq. 2, we are interested in the first two conditions 
(Ray & Majumderr 1984), and it can be sector bounded in the same manner (Jenkins & 
Passino 1999) as described next. 

Consider the fuzzy controller as a nonlinearity cp and assume that there exist a sector (a, P) 
in which cp lies, then use the circle criterion to test the stability. Simply, using the Nyquist 
plot, the sector bounded nonlinearity of the fuzzy logic controller will degenerate, 
depending on its slope a that is always zero (Jenkins & Passino 1999) and the disk to the 

straight line passing through — and parallel to the imaginary axis as shown in Fig. 6 In 

such case the stability criteria will be modified as follows (Vidyasagar 1993): 

Definition: A single-input single-output (SISO) system will be globally and asymptotically 

stable provided the complete Nyquist locus of its transfer function does not enter the 

forbidden region left to the line passing through — in an anticlockwise direction as shown 

in Fig. 6. 

The fuzzy controller is tuned until its parameters lie in the bounded sector, so that the fuzzy 

system nonlinearity is bounded in this sector. In fact, even if the function cp is approximately 

linear, the saturation outside this region causes cp to be always nonlinear. 

From the above discussion, we conclude that to ensure stability for a closed loop system 

with known transfer function or nonlinearity sector, one can add a fuzzy block (stabilizer) in 

the feedback loop tuned in the manner described above and under the condition that the 

stabilizer block is faster than the controlled system. This concept is used to enhance the 

performance of existing control systems especially for systems controlled using fuzzy 

controller in the forward loop. In such cases the feedback fuzzy stabilizer can be integrated 

in the main fuzzy controller as explained in the next section. 
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Fig. 6 Nyquist plot with fuzzy feedback system (Ray & Majumderr 1984). 



4. Self stabilized fuzzy controller 

Figure 7 comprises a plant controlled by a SISO fuzzy controller. In order to guarantee the 
system stability, a fuzzy stabilizer has been added in the feedback path. 
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Fig. 7 Block Diagram of the system with Fuzzy-P controller 

Only, minor changes are necessary to the above analysis in order to include the SISO fuzzy 
controller nonlinearities if these have not been included in the previously calculated sector. 
As a result, the fuzzy stabilizer will be retuned to the new sector which will be the minimum 
intersection between the fuzzy controller nonlinearity sector and the sector results using the 
circle criterion. This is understandable as the fuzzy controller represents an odd function 
(Reznik 1997; Jenkins & Passino 1999) (i.e. (p(-y) = - <p(y)) , so that fuzzy controller can be in 
the feedback path rather than the feed forward path. Therefore, the dominant nonlinear 
sector will be the minimum sector. Consequently from analysis, the feedback stabilizer can 
be built in each fuzzy controller to improve its performance by adding an extra input and 
modifying the original fuzzy rule base by adding the stabilization rules. 

Generally, there are many types of fuzzy reasoning that can be employed in fuzzy control 
applications, the most commonly used types are Mamdani and Takagi-Sugeno (T-S) type. 
For Mamdani fuzzy systems (Farinwata, Filev et al. 2000), the same structure can be used 
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except for the addition of another input y(t) and three extra rules to the rule base as shown 
in Fig. 8. 
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Fig. 8 The modification to the fuzzy system structure 

Where (ix, (i c are the input and output fuzzy sets for Mamdani fuzzy system 
(iy, (1a are the input and output fuzzy sets for fuzzy stabilizer system 
Consequently, less modification is required for T-S type fuzzy systems. 

The main reason for integrating the stabilizer into the normal structure of fuzzy controllers 
is to make them suitable for hardware and software implementation. The same design of the 
circuits or algorithms will be used without significant modifications. 

5. Examples and simulation results 

A plant with transfer function: 



G(S) : 



400 



,s 3 +10.4.s 2 



is + 40 



is used to demonstrate the performance of fuzzy stabilizer. The Nyquist plot of G(jw) is 
shown in Fig. 9. 

The system is unstable and has closed loop poles at -12.6 and 1.08± j 5.82, with a gain margin 
of -19.3dB. If we consider the fuzzy stabilizer as a nonlinearity q> as shown in Fig. 5, then the 

disk D(a, P) is the line segment connecting the points h/'O and hy'O . Applying the 

1 a P 

Circle Criterion and because a = the second condition will be used. To find a sector (a, (3) 
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in which cp lies, the system Nyquist plot Fig. 9 is analyzed. The Nyquist plot does not satisfy 



the second condition as it intersects with the line drawn at - 



P 



-9.259 . In order to meet the 

second condition of the theory the line drawn at — will be moved to be at — = -27.5 such 

P P 

that the Nyquist plot lies to the right of it. As a result, the fuzzy controller will be tuned by 
choosing M, and L such that its nonlinearities lies in the sector (0,0.036). 



Nyquist Diagram 




Real Axis 

Fig. 9 The plant Nyquist plot 

In order to satisfy the circle criterion condition, the ratio M/L will be kept less than p (i.e 

M/L < 0.036) by choosing M = 0.68 and L = 20 . 

A traditional fuzzy like proportional controller (Reznik 1997) is used to control the system 

with a normal feedback loop as we saw in Fig. 7 in order that a comparison can be made 

between the results with and without a fuzzy stabilizer in the feedback loop. In order to 

retune the fuzzy stabilizer, the fuzzy P-controller has a ratio Mq/L c or /? c = 1. 

However /? = 0.036 for the plant, and therefore the minimum sector for the stabilizer to be 

tuned is: (a, j3) = (0, 0.036). 

The system step response (solid line) results with and without the use of the stabilizer 

(dashed line) are shown Fig. 10. The results shows that the system with the fuzzy 

P-controller in Fig. 7 yields an unstable output (dashed line) while the use of the stabilizer 

produces a stable output. 

The approach described has provided a quick and easy stabilization process which can 

allow designers to fine tune their controller's performance without at the same time, being 

worried about stability issues. 
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In Fig. 11 (a), and (b), the step responses for different systems, according to the setup in Fig. 
3, are shown. The simulations show the tested system for a normal feedback without the 
stabilizer and with adding the stabilizer in the feedback loop as in illustrated in Fig. 3. 
Using the same algorithm given a transfer function, a nonlinearity sector and the tuned 
values of M and L of the fuzzy stabilizer, the stabilizer has been tuned. 



— Stabilized Output signal 
-- unstabilized output signal 




Fig. 10 The simulated step response of the two compared systems 




Fig. 11(a) The step Response of the controller with following parameters (Black curve): 



s' +7s 2 +7s + l5 



,(a,p)=(0,0.3),L=l,M=0.3 
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Fig. 11(b) The step Response of the controller with following parameters (Black curve): 
G(s) = -^— ^ , (a, p) = (0, 1.3), L=l, M = 1.3 
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6. Extension to MIMO fuzzy systems 

The stability analysis of multi-inputs multi-outputs (MIMO) is a nontrivial task due to the 
complexity of the system (Safonov 1980), however, many algorithms have been proposed to 
tackle the problem; K. Ray and D. Majumder (Ray & Majumderr 1984) extended their 
approach of using circle criteria to MIMO systems but restricted the result to square systems 
only. The conicity theory has been used by others (Kang, Kwon et al. 1998; Cuesta, Gordillo 
et al. 1999; Cuesta & Ollero 2004) to study the stability of MIMO fuzzy systems but it suffers 
from the nontrivial problem of determining the candidate centre. Linear matrix inequalities 
(LMI) technique is also used (Wang, Tanaka et al. 1996; Lam & Seneviratne 2007) but has the 
disadvantage of high number of LMI used which make the analysis more complicated 
(Cuesta, Gordillo et al. 1999). The description function is also used to study the stability of 
MIMO systems (Abdelnour, Cheung et al. 1993; Aracil & Gordillo 2004). 

6.1 Stability analysis of open loop MIMO systems 

In order to extend the proposed approach fuzzy stabilizer to MIMO (Multi-input Multi- 
output) systems, an additively decomposition technique (Ying 1996) is used. According to 
the structure of the classical problem of LUR'E (Vidyasagar 1993; Cuesta, Gordillo et al. 
1999) shown in Fig. 4 , and referring to the analysis in section 3, a T-S fuzzy system can be 
represented as linear and nonlinear part as follows: 
Consider a T-S fuzzy system with N rules (Cuesta, Gordillo et al. 1999): 



R 



if 



M „ 



AND 



S') 



then 
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with: 



Mi 



a i b j 
c, d . 



and z 



for r = l mxn, where 

S'i , S)2 are the fuzzy membership function corresponding to the state variables 
zi, Z2 , which represented by linguistic terms with membership functions such that: 

K,( Zl =0) = l, //.i(z 1 =0) = 

i # p, i = 1, . . ., m 
and 



/i 2 (z 2 =0)=l, //„2(z 2 =0)=0 



S ^2 



j#q,j = l, ...,n 

and Mij e R 2 * 2 , is the characteristic matrices which represents the fuzzy system. 
Similar to the analysis in section 2, the system state is updated according to: 






(7) 



where <?« (z) = 



Wy(z) 



m h 

EE w ^( z ) 



and Wij(z) is the truth value or weight of the implication Rjj at time t 
Then Eq. 7 can be rewritten as: 



: £iK z -[( i -<v z )) M r 



i=l 7=1 



(8) 



Eq. 8 shows the system has been split into linear and nonlinear part, Fig. 4. Notice that the 
first column of Mij depends on i while the second column depends on j Hence, the resulting 
nonlinear part <p(z) such that: 



m n , s 

i=i ;=i 
is additively decomposable (Cuesta, Gordillo et al. 1999), that is: 

<p(z) = cp(zi, z 2 ) = (p(zi, 0) + <p(0, z 2 ) 
(see (Cuesta, Gordillo et al. 1999) for the proof) 



(9) 



(10) 
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Eq. 10 implies that the nonlinear part 9 is additively decomposable, and therefore 
techniques used for stability analysis of SISO system can be used to stabilize the multi-input 
multi-output systems. This can be done by adding a number of small fuzzy systems equal to 
the number of the output variables in the feedback loop of the MIMO system for each input 
variable as shown in Fig. 12. In this way all the nonlinearities of the fuzzy system can be 
included within a bounded sector. 




Fig. 12 The proposed MIMO fuzzy feedback system 

6.2 Stability analysis of closed loop MIMO system 

A simple stability analysis for closed loop system is shown in Fig. 13 (a). In this system the 
proposed fuzzy stabiliser is placed on each feedback loop for each input as shown in Fig. 
13(b). That includes all the nonlinearities of the system. 
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Fig. 13. (a) MIMO closed loop system 
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Fig. 13. (b) MIMO closed loop system with fuzzy stabilizers 

6.3 Simulation example 

Consider a MIMO system with a state space representation: 
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In our problem we will find a transfer function of the model of the form: 
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Using the analysis described in section 5 and by aid of Nyquist plot of the system as shown 
in Fig. 14 we can determine = -6.45 , as a result M/L < 0.155. 

Note that, for all the components of the system (Gil, G12, G21, and G22), the denominator 
in each case remains the same, since it holds the key to the system stability. 
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Fig. 14 The Nyquist plot of the simulated system 

The outputs of the open loop system show the system instability as shown in Fig. 15. 
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Fig. 15 The open loop response of the simulated system 
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When the fuzzy stabilizers are added to the system according to Fig. 12 and the fuzzy 

parameters are set such that the ratio M/L < 0.007 is kept the same as follow: 

Stabilizer (1) Mll= 3.1 , Lll= 25 

Stabilizer (2) M12= 1.8 , L12= 12 

Stabilizer (3) M21= 0.031 , L21= 0.25 

Stabilizer (4) M22= 0.018 , L22= 0.12 

The simulation results in Fig. 16 show the output of the stabilized system. 
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Fig. 16 The outputs of the stabilized simulated system 

The proposed technique has the advantage of keeping the system stable even if the system 
nonlinearities have been changed provided that they still remain within the bounded sector 
proposed. 



7. Conclusion 

This chapter presented a practical approach to stabilize fuzzy systems based on adaptive 
nonlinear feedback using a fuzzy stabilizer in the feedback loop. For this we needed to 
identify the nonlinearity range of the system. The fuzzy stabilizer is tuned so that the system 
nonlinearities lie in a bounded sector as delivered by using the circle criterion theory. 
Because of circle criterion's graphical nature; the designer is given a physical feel for the 
system. The concept has been used to ensure stability of a car-like robot controller. In 
addition, the idea has been extended to stabilize MIMO systems based on the additively 
decomposition technique. 
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The advantage of the proposed approach is the simplicity of the design procedure especially 
for the MIMO systems analysis and implementation. The use of the fuzzy system to control 
the feedback loop using its approximate reasoning algorithm gives a good opportunity to 
handle the practical system uncertainty. The approach described have provided a quick and 
easy stabilization process which can allow designers to fine tune their controllers 
performance without at the same time, worrying about stability issues It is also shown that 
the fuzzy stabilizer can be integrated, with small modifications, in any fuzzy controller to 
enhance its stability. As a result it is suitable for hardware implementation or even to 
modify existence software and hardware design if required to ensure system stability. 
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1. Introduction 

Recently there has been increased research interest in the study of the hybrid dynamical 
systems (Sun & Ge, 2005) and (Li et al., 2005). These systems involve the interaction of 
discrete and continuous dynamics. Continuous variables take the values from the set of real 
numbers and the discrete variables take the values from finite set of symbols. The hybrid 
systems have the behaviour of an analog dynamic system before certain abrupt structural or 
operating conditions are changed. The event driven dynamics in hybrid control systems can 
be described using different frameworks from discrete event systems (Cassandras & 
Lafortune, 2008) such as timed automata, max-plus algebra or Petry nets. For dynamic 
systems whose component are dominantly discrete event, main tools for analysis and design 
are representation theory, supervisory control, computer simulation and verification. From 
the clasical control theory point of view, hybrid systems may be considered as a switching 
control between analog feedback loops. Generally, hybrid systems can achieve better 
performance then non-switching controllers because they can to reconfigure and reorganize 
their structures. For that is necessery correct coordination of discrete and analog control 
variables. 

The mathematical model for real process, generally, has the Hammerstein-Wiener form 
(Crama & Atkins, 2001) and (Zhao & Chen, 2006). It means that on the input and output of 
the process are present nonlinear elements (actuator and sensor). Here we will consider 
Hammerstein model which has the input saturation as nonlinear element. That is the most 
frequent nonlinearity encountered in practice (Hippe, 2006). Also, unmodeled dynamics 
with matching condition is present. As a control strategy will be used switching control. The 
switched systems can be viewed as higher abstraction of hybrid systems. 
The design of switching controllers having guaranted stability, known as the picewise linear 
LQ control (PLC), is first considered in (Wredenhagen & Belanger, 1994). The picewise 
linear systems are systems that have different linear dynamics in different regions of the 
continuous state space (Johansson, 2003). The PLC control has the associated switching 
surfaces in form of positively invariant sets and yields a relatively low-gain controller. In the 
LHG (low-and-high gain) design a low gain feedback law is first designed in such a way 
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that the actuator does not saturate in magnitude and the closed - loop system remains 
linear. The low gain enlarge the region in which the closed-loop system remains linear and 
enlarge the basin of attraction of the closed-loop systems (Lin, 1999). After that, using 
appropriate Lyapunov function for the closed-loop system, under this low gain feedback 
control law, a linear high gain feedback law is designed and added to the low gain feedback 
control. Combination of LHG and PLC gives the robust controllers with fast transience. The 
key feature of PLC/ LHG controllers is that the saturation level is avoided. But, it has been 
recognized in references (Lin et al., 1997) and (De Dona et al., 2002) that the performance of 
closed-loop system can be further improved by forcing the control into saturation. Such 
controller increases the value of the switching regions so that each linear controller is able to 
act in a region where a degree of over-saturation is reached. The over-saturation means that 
the controller demands for input level is greater than the available range. 
The actuator rate saturation, also, is important problem. Namely, the phase lag associated 
with saturation rate has a destabilizing effect (Saberi et al., 2000). The problem is more 
severe when the actuator is, also, subject to magnitude saturation since small actuator 
output results in small stability margin even in the absence of rate saturation (Lin et al., 
1997). 

The problem is more complex in the presence of delay in the system. The paper 
(Tarbouriech & da Silva, 2000) deals with the synthesis of stabilizing controllers for linear 
systems with state delay and saturation controls. Performance guided hybrid LQ controller 
for discrete time-delay systems is considered in (Filipovic, 2005). In (Wu et al., 2007) the 
method for designing an output feedback law that stabilize a linear system subject to 
actuator saturation with large domain of attraction is considered. It is usually true that 
higher performance levels are associated with pushing the limits (Goodwin et al., 2005). 
That is motivation to operate the system on constraint bounderies. It means that problem 
with actuator saturation can be considered as optimisation with constraints. 
In this paper the robustness of picewise linear LQ control with prescribed degree of stability 
using switching, low-and-high gain and over-saturation is considered. The process is 
described with linear uncertain dynamic system in the state space form. Structure of the 
uncertaintes is defined with matching conditions. By the Lyapunov stability criterion 
(Michel et al., 2008) it is shown that a robust PLC/ LHG controller with allowed over- 
saturation, can exponentially stabilize linear uncertain systems with prescribed exponential 
rate. This approach is different in comparison with (De Dona et al., 2002) where the Riccati 
equation approach is used. 

2. Switching controller with prescribed degree of stability 

The dynamic system subject to input saturation can be described in the next form 

x(t) = Ax(t) + Bsat A u(t),x(t ) = x gXcR" (1) 

Where xeR" and u e R m . Nonlinear input function (saturation function) is defined as 

sat A :R m ^R m 
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A = [A 1 ,--,A m ] , Aj >0 , i = l,2,—,m 

.?a? A (m) = [serf Ai («! ), • • • , sat Am (u m )f 

sat A (ui)=sign (uj)min ( luJ , A 4 j (2) 

The U; is the ith element of vector u. As in (Wredenhagen & Belanger, 1994) we take a 

sequence {pi} i=1 such that p 1 >p 2 ->p N >0 and matrix Q>0. Then that one can to 
define matrix 

Ri = diag[r i 1 ,r 2 ,-,r i m ], r? > 0, j = 1,2,-, m (3) 

Design of optimal LQ regulator with prescribed degree of stability is based on minimization 
of next functional 

j(x ,a)=Je 2at [x T (t)Qx(t) + V T (t)R lV (t)]dt (4) 

to 

The quantity a in (4) defines the degree of system stability for the feedback control systems. 
From (4) for every R i we can get matrices P ; and K ( from equations 

(A + alfPj +P i (A + al)-P i BR i r 1 B T P i +Q = (5) 

K i =R 1 " 1 B T P i (6) 

where P ( is the positive solution of the algebraic Riccati equation (5) for the optimal LQ 
problem. Matrix K ( is gain of the controller. 
The switching surfaces are ellipsoids defined by 



1 T 

-b^Px 



E t = E, (/> , p t )={x : x T e- 2a 'P iX < Pi \ i = \,2-,N {?) 

Elements of matrix R i are chosen, for a given q; , to be the largest so that is satisfied 

^(l + PjJAj, VxeE^Pi) , 

(8) 

j = l,---,m 

where v. is the j th element of v = -K ; x, b. is the j th column of matrix B. 
The ellipsoids in the sequence {E j } N are nested, i.e. 

E i+1 cE; , i = l,2---,N-l (9) 
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IN 



he state space region contained in the biggest ellipsoid can be devided into N cells {Q 

C,=B i /E M , C N =E N (10) 

Now, the controller based on the switching strategy is given in the next form 

v = -KjX , x e C; , i = 1,2...,N (11) 

The controller (5), (6) and (11) for a = is originally proposed in (Tarbouriech & da Silva, 
2000). This controller is known as the picewise linear LQ contoller (PLC). Associated 
switching surfaces to PLC strategy are positively invariant sets (Blanchini, 1999) and 
(Blanchini & Miani, 2008) given by nested ellipsoids. 

The PLC strategy which is proposed in (Lin, 1999) is low-gain controller. Such controller 
underutilize the available control capacity and the resulting convergence of the regulation 
error to zero is very slow although saturation is avoided. 

Remark 1. Recent advances in miniaturizing, communication, sensing and actuation have 
made it feasible to envision large numbers of autonomous vehicles working cooperatively to 
accomplish an objective (Ren & Beard, 2008). The communication band and power 
constraints preclude centralized command and control. As a result a critical problem for 
cooperative control is to design distributed algorithms such that the group of vehicles can 
reach consensus on the shared information in the presence of limited and unreliable 
information exchange and dynamically changing interaction topologies. For the switching 
information exchange topologies the convergence of consensus protocol is proved (Ren & 
Atkins, 2007). As a controller the dwell time controller (switching controller) is used. From 
the stability of switched system follows that consensus can be achieved asymptotically. That 
is the new interesting application of hybrid systems. 

In the next section will be considered switching control strategy which leads to better 
closed-loop performance. 

3. Robust switching controller 

A common feature for controllers which are described in (Hippe, 2006) is that the saturation 
levels are avoided. Because such controllers are low-gain controllers. It has been discovered 
in (De Dona et al., 2002) and (Lin et al., 1997) that the performance of closed loop system can 
be improved by forcing the controls into the saturation. In (De Dona et al., 2002) is shown 
that combination of switching, scaling and over-saturation has a superior performance then 
in the case where each is used seperetly. 
In this section, we define a class of uncertain linear systems 

x(t) = [A + AA (co (t))] x (t)+ [B + AB (co(t))] u (t) (12) 

The matrices A and B are the nominal system and input matrices respectively and AA(co) 
and AB(co) are uncertain matrices which depend continuosly on the uncertainty vector o>(t) 

m(t)efic:R p , te[0,oo) (13) 
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We will suppose that the following assumptions are satisfied 
A.l) (A, B) is controllable 

A.2) Q e R p is a compact set 

A.3) There are continuous mappings 

D(-):<D^R mxn , E0:co^R mxm 

AA(ro) = BD(co) , AB(co) = BE(co) , Vco e Q 

Assumption A.3) is known as matching contition . This assumption can be relaxed using the 

notion of mismatching threshold (Barmish & Leitmann, 1982). 

In the control law (11) a high-gain component is incorporated by multiplying the gains with 

a scaling factor (l + k) with k >0 . The PLC control law with low-and-high gain has the 

form 



u(t) = (l + k)Kix(t) , xeC ; , i = l,2,..,N 



(14) 



where is k design parameter. 

We, also, can to introduce the over-saturation index as in (De Dona et al., 2002). Let us 

define the function (3;(t) as 



f Vl (t)-satA 1 ( Vl (t)) 
Pi(t)= satA^t)) 





Vl (t)*0 
v 1 (t) = 



(15) 



The over-saturation index is a constant (3j such that 

llPiMIL^ 



(16) 



In the case of over-saturation elements of matrix R f are chosen to be the largest so that is 
satisfied 



"b^P.x 



*(l + PiK' VxeE^P.q,) 



(17) 



Now we will formulate theorem which defines the conditions under which control system 

which is described with relations (12) and (14) - (17) is exponentially stable. 

Theorem 1. Let us suppose that the closed loop system described with (12) and (14) - (17) 

for wich together with assumptions A.l) - A.3), also, are satisfied 

A.4) R [ is positive definite matrix 



D 



>0 , i = l,2,...,N 



D 
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A.5) X mm {Q-2D T (co)B T P i -aK^R 1 K 1 }>0 

For precomputed gains 

K; , i = l,2..,N 

ae(0,l) for ||D(co)||>0 , ||E(co)|| > 

a = Q for ||D(co)| = ||E(co)| = 

A.6) X min {E((D) + l}>l-a , VcoeQ 

A.7) The allowed over-saturation for each element Vj of the control vector v is 



■ „.,.,, = mm 

maxJ ! i=l,2,..,N 



1| 4^ min {Q-2D T ( (0 )B T P i -aK^R 1 K i } 
f m -In ■ ii 2 

^{eM+iJ X r '' Wei 

Lq=p+i ) 

where k| is the j th row of matrix K ; 

Then feedback (14)-(17) exponentially stabilize system (12) with exponential speed a for 

VcdsQ and x eEj . 

Proof: In the control systems (12) and (14)-(17) the continuous states x(t) and discrete state 
(relating to the switching paradigm) are present and we choose a piecewise quadratic 
Lyapunov function 

V(x) = x T P ; x, x e Q , i = 1,2...,N (18) 

Using relation (12) we have 

V(x) = x T P iX + x T P iX = [(A + AA(co))x + (B + AB(co))] T ■ 

■ PjX + x T P[(A + AA(co))x + (B + AB(co))sat A (u)] = 

= [x T (A + BD(co)) T +(sat A (-(l + k)K 1 x)) T (B + BE(co))] T • 

PjX + x T Pj [(A + BD(co))x +(B + BE(co))sat A (- (l + k)Kjx)] = 

x T [(A + BD(co)) T Pj + ^ (A + BD(co))]x + 

+ (sat A (-(l + k)Kix)) T (B + BE(co)) T PiX + x T Pi- 

■(B + BE(co))sat A (-(l + k)Kix) (19) 

From relation (5) we have 
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A T P i +P i A = P i BR i " 1 B T P i -2aPi-Q (20) 

Using relation (20) the first term in (19) becames 

x T [(A + BD(co)) T Pi + P ; (A + BD(co))]x = 

x T (A T P i +P i A)x + x T D T (ffl)B T P i x + x T P i D(ca)x = 

= x T P i BRr 1 B T P i x-2ax T P i x-x T Qx+ +x T D T (co)B T P i x + x T P i BD(a))x = 

= -x T P i Bv-2aV(x)-x T Qx + x T D T ((B)B T ((o)-P i x + x T P i BD(co)x = 

= v T P i v-2aV(x)-x T Qx-x T D T (co)R i v-v T R i D(co)x (21) 

Also, one can get 

x T Qx + x T D T (co)R i v + v T R i D(co)x-av T R i v-av T R i v = 
= x T Qx-2x T D T (co)B T P i x-axKj r R i K i x + av T R i v = 

= x T (Q-2D T (co)B T Pi -aKj r R i K i )x + av T R i v . . 

For a second and third terms in relation (19) we have 

(sat A (-(l + k)K i x)) T (B + BE(co)) T P i x+ x T P i (B + BE(m))sat A (-(l + k)K i x) = 

(sat A (-(l + k)K i )) T B T P i x + (sat A (-(l + k)K i )) T E T (co)B T P i x + Bx T P i Bsat A (-(l + k)K i x)+ 

x T P i B + BE(©)sat A (-(l + k)K i x) = -sat A (-(l + k)K i x) T R i v-(sat A (-(l + k)K i x)) T E T (co)R i v- 

- v T R iS at A (- (l + k)K ; x)- v T R i E(co)sat A (- (l + k)K iX ) = 
= -(sat A (- (1 + k)K lX )) T [e t (co)+ iJRjV - v T Rj [e t («)+ l]sat A (- (l + k)K f x) 

= -2v T R 1 [E T (co)+l]5at A (-(l + k)K 1 x)<-2X min {E(co)+l}v T R i sat A (-(l+k)k 1 x) 

* "2^ mm {EM + l}v T R iS at A (- (1 + k)k iX ) (23) 

From (19) and (21)-(23) and assumption A6) of Theorem follows 

V(x)<(l + a)v T R i v-2aV(x)-x T (Q-2D T (co)B T P i -aKJ r R i K 1 )x- 
- 2?, min {E(co) + l}v T R lS at A (- (l + k)k lX ) < -2aV(x)+ X mm {E(co) + l}v T R 1 v - x T ■ 
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(Q-2D T (co)B T P i -aK 1 T R i K i )x-2A, min {E(co)+l}v T R i sat A (-(l + k)k i > 
-2aV(x)-x T (Q-2D T (co)B T P i -aK^R 1 K i )x + +>. mm {E((D)+l}- 

Xr)|v j |j Vj |-2sat Aj ((l + k)K i | Vj |)) 
In last relation is the V, is the i th element of v ; = -K ( x i.e. 



V: = - — bJP i x = -k>x 



From relation (17) one can get 

| v i|4+PK4 + [P m ax]j) ' VxeC.cE, , j = \,2,-,m 

and |B max J. is defined in the assumption A. 7) of Theorem. 
For the x ^ we have two posibilites 

K|- 2A is ' isss a , S 1 ={j 1 ,j 2 ,---,jp) 

2A is < hJ ^ f 1 + [Pmaxl Kis ' Js eS 2 , S 2 = (j p+1 , 



and 



p+1' 'Jmj 



whereby 



0<p<m , S 1 uS 2 = {l,2,..,m} , S,nS 2 



Using argument as in (De Dona et al., 2002) one can get 
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( m A 
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From (24) and (29) follows 



p . 
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(24) 



(25) 
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v l=p+l 



(30) 



According to relation (27) first term in (30) is always nonpositive, i.e. 

|v, |-2sat A (l + kjVi |)<0 , Vk>0 

I J s I is V | 's|/ 

It is well known fact that quadratic polynomial 

z +p x z + p 2 , zeR 1 
is negative if zeros belong to the interval 

•Pi 



(31) 



-j-Vd - Pl +VrP 



, D=p^-4p 2 



Using those facts it is possible to conclude that second term in relation (30) is nonpositive if 



hs p 
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This is true because from the definition of elipsoid E t fallows 

|v, 8 |<(l + [fi max ]. s ^ js , j s sS 1 uS 2 

whereby [(3 max J| is defined in the assumption A7) of Theorem. 
From relation (31)-(32) follows 

V(x)<-2ctV(x), Vx*0 , xeCi ,i = l,2,...,N 
It means that closed-loop system is exponentially stable. Namely 

||x(tj| < k^x^Hexpj-a^ -tj)}, Vx * , x e C ; 
whereby 



(32) 
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i = l,2,...,N 



(33) 



(34) 



(35) 
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The trajectories in each cell C; approach the origin with a exponential decrease in V\x) 
along the trajectory. According with the philosophy of control strategy, the trajectories will 
enter the smallest ellipsoid corresponding to p N . The exponential stability is assured by (33). 
Remark 2. In the paper (De Dona et al., 2004) is considered the case when in the model (12) 
uncertainty matrix AB(w) = and the degree of prescribed stability a = . Also, in that 
reference the Riccati equation approach is used until in this paper the Lyapunov approach is 
used. 
Remark 3. When unmodeled dynamic is absent, i.e. in the Theorem 1 

a = , ||D(w)| = ||E(w)| = 
the conditions A.5) and A. 7) in the Theorem have the form 



4^ mfa {Q} 



's i=1.?.. ..N I ( 

YA IN 5 



V'=p+i 

These assumption are identical with the assumptions in reference (De Dona et al., 2002). 

4. Conclusion 

In this paper the switching controller with low-and-high gain and alloved over-saturation 
for uncertain system is considered. The unmodeled dynamics satisfies matching conditions. 
Using picewise quadratic Lyapunov function it is proved the exponential stability of the 
closed loop system. 

It would be interesting to develop the theory for output case and for the descrete-time case. 
Also, extremely is important application of hybrid systems in distributed coordination 
problems (multiple robots, spacecraft and unmanned air vehicles). 
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1. Introduction 

Torque is one of the fundamental state variables in powertrain systems. The quality of 

motion control highly depends on accuracy and dynamics of torque generation. Modulation 

of the air massflow by opening or closing a throttle is the classical way to control the torque 

in gasoline combustion engines. 

In addition to the throttle and the advance angle several other variables are available to 

control the torque of modern engines - either directly by control of mixture or indirectly by 

influence on energy efficiency. The coordination of the variables for torque control is one of 

the major tasks of the electronic control unit (ECU). 

In addition to the generation of driving torque other objectives (emission threshold, fuel 

consumption) have to be taken into account. The large number of variables and pronounced 

nonlinearities or interconnections between sub-processes makes it more and more difficult 

to satisfy requirements in terms to quality of control by conventional map-based control 

approaches. For that reason the investigation of new approaches for engine control is an 

important research field (Bauer 2003). 

In this approach substitute variables instead of the real physical control variables are used 

for engine torque control. The substitute variables are used as setpoints for subsidiary 

control systems. They can be seen as torque differences comparative to a torque maximum 

(depending on fresh air mass). One advantage of this approach is that we can describe the 

controlled subsystems by linear models. So we can use standard design methods to 

construct the torque controller. Of course we have to consider variable constraints. Due to 

the linearization the resulting control structure can be used in conventional controller 

hardware. 

For the superordinate controller a Model Predictive Controller (MPC) based on state space 

models is used because with this control approach constraints, and also setpoint- and 

disturbance progressions respectively, are considered. 

In chapter 2 we briefly explain the most important engine processes and the main torque 

control variables. The outcome of this explanations is the use of a linear multivariable model 

as a bases for the design of the superordinate torque controller. The MPC- algorithm and its 

implementation are specified in chapter 3. Chapter 4 contains several examples of use. 

In this chapter it is also shown how the torque controller can change to a speed- or 

acceleration controller simply by manipulation of some weighting parameters. 
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2. Process and torque generation 

Combustion and control variables: In figure 1 the four cycles of a gasoline engine are 

shown in a pressure/ volume diagram. The mean engine torque results from the difference 

energy explained by the two closed areas in the figure. 

The large area describes the relation of pressure and volume during compression and 

combustion. The charge cycle is represented by the smaller area. 

For good efficiency the upper area should be as large as possible and the area below should 

be minimal (for a given gas quantity). The best possible efficiency in theory is represented 

by the constant-volume cycle (or Otto cycle) (Grohe 1990, Urlaub 1995, Basshuysen 2002). 



Cylinder 
pressure p 




TDC 



UDC 



> Cylinder 
Volume V 



Fig. 1. p-V-Diagram (gasoline engine) 

The maximum torque of the engine basically depends on the fresh air mass which is 
reaching the combustion chamber during the charge cycle. A desired air/fuel mixture can 
be adjusted by injecting an appropriate fuel quantity. The air/ fuel-ratio is called lambda (X). 
If the mass of fresh air corresponds to the mass of fuel (stoichiometric ratio) the lambda 
parameter is one (^=1). By several reasons the combustion during the combustion cycle is 
practically never complete. So after the cycle there remain some fresh air and also unreacted 
particles of fuel. A more complete reaction of the air can be reached by increasing fuel mass 
(X<1). This results in rising head supply and hence to an increase of the potential engine 
torque. The maximum of torque is reached by approximately X=0.9. The torque decreases 
more and more if lambda increases. So one can realize fast torque changes for direct 
injection engines by modulation of the fuel. 

The maximum of lambda is bounded by combustibility of the mixture (depends on 
operating conditions). For engine concepts with lean fuel-air ratio (^>1) the fuel feed is the 
main variable for torque control. Because of the exhaust after treatment >^=1 is requested for 
most gasoline engines at present. In case of weighty demands however it is possible to 
change lambda temporary. 
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In addition to air and fuel the torque can be controlled by a number of other variables like 

the advance angle or the remaining exhaust gas which can be adjusted by the exhaust gas 

recirculation system. 

For turbocharged engines the air flow depends in addition to the throttle position on 

turbocharger pressure which can be controlled by compressors or by exhaust gas turbines. 

Pressure charging leads to an increase of the maximum air mass in the combustion chamber 

and hence to an increasing maximum of torque even if the displaced volume of the engine is 

small (downsizing concepts). 

Because of the mass inertia of the air system it takes time until a desired boost pressure is 

achieved. The energy for the turbocharging process is taken from the flue gas stream. This 

induces a torque load which can result in nonminimumphase system behaviour. 

In addition to the contemplated variables there are further variables imaginable which 

influence the torque (maximum valve opening, charge-motion valve, variable compression 

ratio, etc.). 

Furthermore the total torque of the engine can be influenced by concerted load control. For 

example one can generate a negative driving torque by means of the load of the electric 

generator. The other way around one can generate a fast positive difference torque by 

abrupt unloading. This torque change can be much faster than via throttle control. 

In hybrid vehicles we have a number of extra control variables and usable parameters for 

torque control. 

More details on construction and functionality of internal combustion engines are described 

by Schafer and Basshuysen 2002, Guzella and Onder 2004 and Pulkrabeck 2004. 

For control engineering purposes the torque generation process of a gasoline engine is 

multivariable and nonlinear. It has a large number of plant inputs, a main variable to be 

controlled (the torque) and some other aims of control like the quality of combustion 

concerning emission and efficiency. 

Considerable differences in the dynamic behaviour of the subsystems generate further 

problems particularly for the implementation of the control algorithms. For example we can 

change the torque very fast by modulation of the advance angle (compared to the throttle). 

Normally the value that causes the best efficiency is used for the advance angle. However to 

obtain the fastest torque reduction it can be helpful to degrade the efficiency and so the 

engine torque is well directed by the advance angle. Fast torque degradation is required e. g. 

in case of a gear switching operation. For the idle speed control mode fast torque 

interventions can be useful to enhance the stability and dynamics of the closed loop. Fast 

load changes also necessitate to generate fast torque changes. In many cases the dynamic of 

sub-processes depends on engine speed and load. 

Whether the torque controller uses fast- or slow-acting variables depends on dynamic, 

efficiencies and emission requirements. Mostly the range of the manipulated variables is 

bounded. 

In the normally used "best efficiency" mode one can only degrade the torque by the 

advanced angle. For special cases (e. g. idle speed mode) the controller chooses a concerted 

permanent offset to the optimal advance angle. So a "boost reserve" for fast torque 

enhancement occurs. Because of the efficiency this torque difference is as small as possible 

and we have a strong positive (and changeable) bound for the control variable. This should 

be considered in the control approach. The main target variables and control signals are 

pictured in figure 2. 
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We can summarize: The torque control of combustion engine is a complex multivariable 
problem. In addition to the torque we have to control other variables like lambda, the 
efficiency, EGR , etc. The sub-processes are time variant, nonlinear and coupled. The control 
variables are characterized by strong bounds. 



Operating Conditions 

Load Torque 
Engine Speed 
Temperature 
Ambient pressure 



Torque Setpoint 



Control 
System 



Actuating Variables 

Throttle 

Wastegate 

Advance Angle 

Lambda 

EGR 

Valve Position 



Process 



Driving Torque 
Consumption 
Exhaust gas quality 

*- 



Fig. 2. Actuating and control variables 

Torque control: Below we describe the most important variables for torque control and the 
effect chain. 
a.) fresh air mass: 

As aforementioned the engine torque mainly depends on the fresh air mass as the base for 
the maximum fuel mass. Classical the air mass is controlled by a throttle within the intake. 
In supercharged engines the pressure in front of the throttle can be controlled by a 
turbocharger or compressor. The result of supercharging is a higher maximum of fresh air 
mass in the combustion chamber. This leads to several benefits. In addition to the throttle or 
charging pressure the air mass flow can also be controlled by a variable stroke of the inlet 
valve. 

It is not expedient to use all variables of the air system for torque control directly. For the 
torque generation the air mass within the cylinder is mainly important (not the procedure of 
the adjustion). Hence it is more clever to use the setpoint for the fresh air mass as the plant 
input for the torque controller. This can be realized by the mentioned control variables with 
a the aforementioned sub-control system. One advantage of the approach is, that the sub- 
control system at large includes measures for the linearization of the air system. 
Furthermore the handling of bounds is a lot easier in this way. 
b.) exhaust gas recirculation (EGR): 

By variable manipulating of the in- and outlet valves it is possible to arrange, that both 
valves are simultaneously open during the charge cycle (valve lap). So in addition to the 
fresh air some exhaust gas also remains the combustion chamber. This is advantageous for 
the combustion and emission. In case of valve lap the throttle valve must be more open for 
the same mass of fresh air. This is another benefit because the wastage caused by the throttle 
decreases. The EGR can also be realized by an extern return circuit. Here we only consider 
the case of internal EGR. The EGR induced a deviation from the desired set point of fresh 
air. This deviation usually will be corrected by a controller via the throttle. As 
aforementioned this correction is relatively slow. If the mechanism for the valve lap allows 
fast shifting we can realize fast torque adjustments because the change of air mass by the 
valve lap occurs immediately. 
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c.) advance angle: 

An important parameter for the efficiency of the engine is the initiation of combustion. The 

point of time mainly depends on the advance angle. The ignition angle which leads to the 

best efficiency is called the optimized advance angle. It depends on engine speed and the 

amount of mixture. A delay in the advance angle leads to less efficiency and to the decrease 

of engine torque. The relation between advance angle and torque is nonlinear. 

d.) lambda: 

The amount of fuel determines (for a given fresh air mass) the torque significantly. As 

aforementioned bounded we can manipulate the torque by lambda. For direct injection 

concepts the torque changes immediately if lambda is modified. Torque control by means of 

lambda is limited for engine concepts with X=l. 

e.) other variables 

The quality of mixture has also an effect on the efficiency and so on the torque. For the 

quality several construction parameters are important. Further the quality depends on 

adjustable variables like fuel pressure, point of injection or the position of a charge-motion 

valve. The setpoints for these parameters are primarily so chosen that we obtain best 

mixture efficiency. 

In figure 3 the main variables and the effects on the process are outlined. 
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Fig. 3. Main variables to influence the energy conversion and the torque generation 

From the explanation up to now we can deduce appropriate variables for torque control: 

- fresh air mass in the combustion chamber (adjustable via charge pressure, throttle, 
valve aperture and valve lap), 

- fuel, lambda (adjustable by injection valve), 

- advance angle, 

- internal exhaust gas recirculation (adjustable via valve lap) 
All variables are bounded. 

Subsidiary control systems: For simplification of the torque control problem it is expedient 
to divide the process in to several sub-control circuits. The following explanations are aimed 
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to gasoline engines with turbo charging, internal EGR, direct injection and homogeneous 

engine operation (^-=1). 

The sub-control circuits are outlined in figure 4. The most complicated problem is the design 

of the subsystem for fresh air control. Because of the strong connection of fresh air and EGR 

it makes sense to control both simultaneously in one sub-system. The control system has to 

consider or compensate a number of effects and nonlinear dependences. 

The main task is to control the fresh air mass and the desired amount of exhaust gas which 

reaches the combustion chamber during the charge cycle. The control system should reject 

disturbances and adjust new setpoint values well (fast, small overshoot). Another 

requirement is that it should be possible to describe the complete control circuit by a linear 

model. 

For the control structure we propose a reference-model controller (figure 5). One of the aims 

of this control structure is to achieve a desired dynamic behaviour for the circuit. The 

mentioned requirement for simplification of the modelling is implicitly given in this way. 

Nonlinear effects can be compensated by appropriate inverted models. 
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Fig. 4. Model Following Control (MFC) 

In the explained approach the fresh air mass is used as the main control variable. From this 

air mass results the maximum possible torque with approximately X = 0.9. 

In case of modification of the exhaust gas portion we only consider the influence on the 

fresh air. The influence on combustion quality is disregarded. Increasing of exhaust gas 

portion leads to decreasing of fresh air and so to reduction of torque. This influence is 

compensated by a special controller so that the action of EGR on the engine torque is 

comparable to the characteristic of a high-pass filter. An advantage is obtained if the acting 

effect from EGR-setpoint to the fresh air is faster than via the throttle. In this case it is 

possible to realize fast (but transient) changes of the torque. 

For torque manipulation via the advance angle it is not necessary to use a feedback system. 

The influence on the efficiency of the engine can be modelled by a characteristic curve. So 

for this control path a feedforward control action is adequate. We can find the same 

conclusion for lambda. 

Second-level control system: By means of the described sub-control systems the actuating 

variables of the torque control system can be defined. The main variables are the setpoints of 

the fresh air mass, the exhaust gas, the advance angle and the setpoint for lambda. The 

relation between the torque and the mentioned variables is nonlinear. Following the torque 

controller also had to be nonlinear and the controller design could be difficult. 

A more convenient way is to linearize the sub-control systems first by stationary functions 

at the in and outputs. In the described control approach the control variables are substituted 

by a number of partial torque setpoints (delta torques). The partial torques represent the 
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influence of the used control variables to the whole engine torque. The over-all behaviour of 

the system to be controlled is linear (widely). 

Base control variable is the theoretical maximum of the torque for a given fresh air mass. 

The other variables can be seen as desired torque differences to a reference point (the 

maximum of torque). That means if the delta variables all set to zero, the maximum of 

torque for the given fresh air mass should be generated by the engine. 

The difference variables are both: actuation variables for the torque and variables to be 

controlled. In this way it is possible to adjust stationary other setpoints for lambda, the 

residual exhaust gas or the advance angle. 

We can find the bounds for the substituted variables from the physical bounds also by using 

the linearization functions. It is assumed that the nonlinear functions are monotone in the 

considered range. 

From the explanations above the total control structure follows (figure 5). The plant for the 

torque controller consists of the sub-control-systems (fresh air, residual exhaust gas, and 

lambda and advance angle) and the stationary linearization systems. The whole system can 

be described by a linear multivariable model. Only the bounds of the input variables have to 

be taken into account. The plant outputs are the engine torque and the difference variables. 

Other plant outputs could be estimated values for exhaust gas emission or for fuel 

consumption (can be estimated by simplified models). 

The setpoints for the torque are generated by the driver or by other ECU-functions (e.g. for 

changing gears). For the realization one can find different requirements. 

For example we have to realize a requested torque as fast as possible. For other situations 

the time could be less important and one wish to realize a desired torque with a minimum 

of consumption. The control system should be able to handle all the different requirements. 

The control variables are not available at all time. In general the range of the control 

variables is bounded. Sometimes they may not differ from the setpoint. The torque 

controller should consider this variability of bounds. 

An appropriate control principle with the ability to solve this problem is the well known 

model predictive controller (MPC). This approach can handle variable bounds as well as 

known future setpoints and disturbances. Future setpoints are known for instance for 

changing gears. Switch-on of the air conditioning system is an example for known future 

disturbances. Matching to miscellaneous priorities is possible by appropriate weighting 

matrices in the control algorithm. 

The model structures for the superordinate torque control system can be explained from 

figure 5. 

For example we can arrange the model equation 
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if we use all of the aforementioned control variables (fresh air, residual exhaust gas, lambda 
and advance angle). The output signals of the model are the engine torque (yi) and the three 
difference control variables which appear in fact (y2 ... V4). 
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Fig. 5. Two layer control approach 

The engine torque (yi) depends on all actuating variables. Gn(s) describes the effect of the 
fresh air path to the engine torque, Gi2(s) models the effect of the exhaust gas path on the 
engine torque, Gi3(s) describes the behaviour of the torque depending on lambda and Gi4(s) 
the effect of the ignition angle path. We may assume, that the output variables y2 ... y4 are 
only dependent on the corresponding setpoints because of the decoupling effects of the sub- 
control circuits. G22(s), G33(s) and G44(s) are simple low-order transfer functions. 
If we have (in addition to the torque setpoint) other control requirements (e.g. consumption) 
it is easy to extend the model by appropriate transfer functions for the estimation of the 
control variable effects on the considered output variables. For exhaust gas emission and 
fuel consumption this results in a model as follows: 
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(2) 



The extension of the model is possible by using any other variable of the engine as plant 
output signals. Possibilities are, for instance, the engine speed or the acceleration. 
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In this way we can consider the torque and the speed simultaneously. The model has to be 
completed to: 



}'i 
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(3) 



y7 describes the engine speed. The transfer functions of the lower line describe the effects of 
all inputs on the speed. 

Certainly, it is not possible to adjust torque and speed independently. The structure allows 
however to switch between torque and speed control smoothly. This can be done by the 
weighting parameters of the MPC. In the same way it is feasible to extend the model by a 
line which describes the acceleration depending on the input signals. A disadvantage of the 
model extensions is the increasing complexity. 

3. MPC-algorithm and implementation 

3.1 Approach 

As mentioned the model predictive control concept seems to be an appropriate approach for 
torque control. In the following we describe the basics of this control concept. The 
explanations are focused mainly on a state space solution. 

MPC is widely adopted in industries, primarily in chemical engineering processes. This 
control principle is an established method to deal with large constrained multivariable 
control problems (Dittmar 2004, Diinow 2004, Grimble 2001, Maciejowsky 2002, Salgado et 
al. 2001). The main idea of MPC is to precalculate the control action by repeatedly solving of 
an optimization problem (see figure 6). The optimization aims to minimize a performance 
criterion during a time-horizon subject to selected control and plant signals. An appropriate 
and usual criterion is the quadratic cost function 

H P H u -1 

J(k) = J^ (y(k +i\k)- r(k + i)f 0, {y{k + i\k)- r(k + i)) + J^Aii T (k + i\ k)R, Au(k + i \ k). (4) 



y(k + i ' \ k) denotes the predicted output for the time k + i calculated at the time instant 

k. 

Am denotes the difference control signal. Q and R are the weighting matrices for the input 
and the output variables of the process. H p and H u are prediction and control horizons. The 
use of difference control signals for the cost function leads to better control structuring and 
inhibits high-frequency signals. Unlike huge computing potential in other application fields 
of MP-Controllers the hardware capacity of engine control systems is limited in memory 
and computing power. For this reason we implemented a MPC-algorithm which is suitable 
for realtime application in engine control units. As minimum requirements the control 
algorithm should allow the application to multivariable systems with variable constraints in 
the control output. A main design requirement was the achievement of reasonable 
computing efforts. 

For the MP-Controller we use a set of linear constant time-discrete state-space process 
models of the form 
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x(k + \) = Ax(k) + Bu(k) 
y(k) = Cx(k) 



(5) 



(u-input signal, y-output signal, x-state of the system, A-system matrix, B-input- and C- 
output matrix). 
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Fig. 6. Predictive Control: The basic idea 

In case of control signal constraints we have to consider the conditions 

U mm (k) < U(k\k) < !!_(*) 

u mm (k + \) < u(k + \\k) < u max (k + \) 



(6) 



^Jk + H„-\) < u{k + H u -\\k) < u m Jk + H u -\) 
From the model we obtain the prediction for the output of the system for a horizon of H 
steps at time k . 
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= y(k) = Wx(k) + Yu(k - 1) + ®AU(k) 



(8) 
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With the setpoint signal r(k + i \ K) , i : — . . . H , the equation 



E(k)- 



r(k | k) 

r(k + \\ k) 

r(k + H\k) 



■ "¥x{k) - Yu(k - 1) 



and the weight matrices 



Q 



6(0) o 

o 2(1) 



respectively R ■ 



R(0) 
fi(l) 



■■• Q(H p ) 

the cost functional (4) becomes to 

J(k) = -AU(kfG + AU T HAU(k) 



R(H tl -\) 



with 



G = 2® T Qe{k) and H = ® T Q® + R 



(9) 



(10 and 11) 



(12) 



(13 and 14) 



The constraint conditions from equation 6 we can formulate as a linear matrix inequation 
like 



AAU(k) < U a 



(15) 



with 



U„ 



(16 and 17) 



u ma (k)-u(k-l) 
u m (t + \)-u(k -1) 

u nax (k + H„-l)-u(k-l) 

-u min (k) + u(k-l) 
-u min (k + Y) + u(k-l) 

-u mia (k + H u -l) + u(k-l) 



The main problem of the model predictive controller is to minimize the cost function 12 
subject to the constraints expressed in inequation 15. To solve this standard convex 
optimization problem (which is summarized in figure 7) efficient numerical procedures 
(quadratic programming (QP)) are available. For the realtime application we developed a 
computing time optimized QP program which uses an active set algorithm (Diinow et al. 
2005, Lekhadia et al. 2004, Lekhadia 2004a). We implemented this optimization algorithm 
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for Mathworks xPC-Target and for an Infineon TriCore microcontroller board. This 
microcontroller is actually also used in ECU-Systems. 



AAU(k) < ^constraints 



1 



J(k) = -AU(k) T G + AU(k) T HAU(k) ► Min. 

Standard Problem ► Quadratic Programming 



Essential Parameters: 


Hp, Hu 
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: States 


u 


: Inputs 
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: Model of the plant 



fl, (j, A, U C onstrai 

Active Set 
Methods 



Fig. 7. MPC-algorithm (summarized) 

3.2 Implementation 

The model predictive controller was implemented by means of the packages LAPACK and 
BLAS for C-language (Dtinow et al. 2005). These Libraries are efficient concerning the 
computing effort and widely used in research and industrial applications. The MPC- 
algorithm was implemented by a C-S-function for Matlab / Simulink to use it on a xPC 
Target system. Because of the limited capabilities of the ECU the implemented control 
algorithm should need less computing time and memory space. For the unconstraint case 
we can find a constant MPC solution. Here the bounds will be ignored. The optimal AU(k) 
for the unconstrained case we find by 



(dJIdAU) = V AU(k) J = -G + 2HAU(k) = 
From equation 18 follows the optimal set of control variables: 

1 



AU(k) 



-H'G 



Condition 16 minimizes the cost function because of the fact that 



8 2 J 
8AU(k) 2 



2H = 2-(@ T Q@ + R) 



(18) 



(19) 



(20) 
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is positive defined if Q > and R > . In figure 8 the unconstrained case is summarized in 
a block diagram. The implementation of a controller which based on equation 19 is outlined 
in figure 9. Here the controller consists of a linear state space model, three constant gains 
and a simple wind up clipping structure (Diinow et al. 2005). 
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Fig. 8. Block diagram of the unconstrained case 



a> 



c±> 



u(k) 



* *• 



CD 

Setpoints 

CD— >Q 

Measured J 

Outputs 



o- 



Predicted Outputs 




■<— » 



states u(k- 1 ) 



Fig. 9. Implementation of the constant Controller in Matlab/Simulink (Diinow et al. 2005) 

For the constrained case we have to minimize the cost function (see equation 12) regarding 
to the controller constraints output marked in equation 15. With the help of active set 
methods we solved this standard quadratic programming optimization problem. In figure 
10 the flow chart, which based on Fletscher 1997, Gill et al. 1991 and Maciejowski 2002, of 
the optimization algorithm is shown. The active set method involves two phases. At first a 
feasible point is calculated. The second phase involves the generation of an iterative 
sequence of feasible points that converge to the solution. The >u-check below in figure 10 is a 
test for Karush-Kuhn-Tucker condition for a global optimum (Fletscher 1987). 
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Find Initial feasible solution 
Aj= []; Initially no active constraints 
Uk= 0; where [m, n] = size(Ai) 
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Fig. 10. Flow chart of the implemented active set algorithm (Diinow et al. 2005 and Lekhadia 
2004a) 



4. Practical applications and results 

In this section we will investigate practical aspects of the described approach. As a test 
environment we used a complex nonlinear model of a four-cylinder direct injection engine. 
This model includes the physical elements of the engine as well as the necessary ECU- 
functions for torque control. The model is usable at Matlab/Simulink. The model is briefly 
depicted in figure 11. 
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Fig. 11. Engine model (simplified) 

The model was completed by a part for exhaust gas emission and one part for fuel 

consumption. 

Because of the different dynamics of the control pathways the sampling time is relatively 

small. This results in a corresponding large prediction horizon of the MPC. Regarding the 

control horizon we btained the best compromise between performance and computational 

effort for H u = 5. 

Figure 12 foreshadows the structure of the simulation system. We used a state space method 

for the modelling and solution of the control problem. The MPC-Block in the left of the 

simulink-model includes the control solution explained in section 3. The process model can 

be seen as an alternative to a real engine. The background of the model was illustrated in 

section 1 and 2. The model in the controller block is conform to equation 2. 
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In figures 13a to 13d an application case of the torque controller is shown. The main 
objective of the control process in this example is to follow the torque setpoint as accurately 
as possible. The upper figure (13a) shows the setpoint and the real values of the torque. 
Below (13a) the four control variables are pictured. At about 1.5 seconds we have a setpoint 
step. At 2.5 seconds a load step occurs. This load step is predictable (turn-on procedure of 
the air conditioner). In the example the setpoints for the delta torques of the ignition and 
exhaust gas path are zero. The setpoint for the lambda torque path is here about -8 Nm. This 
corresponds to X,=l. The torque value of 8 Nm can be seen as a "reserve" and the MPC can 
boost the total engine torque due to the fuel path immediately by this value. The upper 
bound for the delta torque via the fuel path is zero in the example. 



- Torque setpoint 

- Torque 




_rrrx 



Torque (fresh air) 
Torque (exhaust gas) 
Torque (ignition angle) 
Torque (lambda) 




Fig. 13a. Results - max torque priority 

The upper and lower bounds for the exhaust gas path in the example are equal (zero). That 
means the MPC must not use this variable for torque control until the constraints are 
changing. The value of the controller output always is equal to the setpoint (as required). 
Because of the fact that the controller outputs for the delta torques at the same time are 
variables to be controlled these outputs are used only temporarily by the MPC (also visible 
in figure 13b). The lower bounds for the fuel and the ignition path in the example are -15 
Nm. The setpoint step for the total engine torque at 1.5 seconds was predictable. It is visible 
that the MPC automatically prepared this action. The maximum torque (represented by the 
air path) increases. The available delta variables (fuel and ignition path) are decreasing. So 
the total torque doesn't change up to 1.5 seconds. In this way the MPC builds up a torque 
reserve which can be used (temporarily) to advance the setpoint adjustment. 
The compensation of a predictable load step (in the example about at 2.5 seconds) occurs in 
the same way. Figure 13b and 13d give a zoomed view to the load compensation. The 
controller is able to readjust the torque very fast due to the torque reserve. 
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Fig. 13b. Results - max torque priority 

In the example above the control request was focused mainly to minimize the engine torque 
deviation from the desired setpoint. This was achieved by appropriate weighting variables 
within the model predictive control algorithm. 
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Fig. 13c. Results - min torque priority 
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By changing the weighting factors it is also possible to shift the focus more towards other 
attributes. For instance one can achieve a reduction of the waste gas emission by a setpoint 
for corresponding X=l and increasing the weighting factors of the lambda path. 
The results are shown in figure 13c for the same setpoint and load steps as in the example 
above. It is visible that the torque adjustment is (compared to figure 13a) slower and the 
controller output for the fuel path is used only marginally. 
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Fig. 13d. Results - min torque priority 

As mentioned in section 2 the described control approach allows to switch smoothly 
between torque, speed or acceleration control. This can be simply achieved by the model 
extension (described in section 2, equation 3) and appropriate weighting parameters for the 
MPC. Only by shifting the weighting factors one can switch from speed control to torque 
control and back. 

The plots in figure 14 demonstrate how this control mode works. In this example we used 
only the fresh air and the ignition path for torque control. Up to 3 seconds the controller 
should work as a speed controller. From then it should work as a torque controller. At 
approximately 6 seconds the weighting factors where switched back to speed control mode 
values. Equation 21 represents the linear model which is used internally by the MPC for this 
example. 
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Switching between speed and torque control can simply be realized by changing some 
weighting factors. The behaviour of the MPC changes automatically. This is a very 
convenient way to change for instance to the idle speed control mode and back to torque 
control. 

In figure 14 first the speed control is active. The speed follows the desired characteristic 
whereas the torque is different from the corresponding setpoint. After changing the 
weighting factors the torque will be adjusted to the setpoint and the engine speed is only 
marginally considered by the controller. 
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At approximately 6 seconds the engine speed control mode was activated again. The speed 
should decrease here from the current value smoothly along a given trajectory. It is visible 
in figure 14 that the controller solves the differing control tasks properly. In the third plot 
(from the top) one can see also that the ignition path control variable is used transient by the 
controller. Stationary the setpoint is adjusted. 

5. Conclusion 

In the chapter we discussed a control approach for torque control of gasoline engines. 
Because of several actuating variables and control requirements the process to be controlled 
is multivariable. The actuating variables are usually bounded and the effects on the engine 
torque are nonlinear. Hence direct use of the actuator variables for torque control generally 
produces plenty of problems. 

The two layer approach described in the chapter allows the application of standard control 
methods. The main idea of the control structure is to compensate or alleviate the 
nonlinearity behaviour by subordinate control circuits. All the physical actuating variables 
are substituted by setpoints of the subordinate systems. The torque controller so can be 
designed on base of linear models. Additionally only bounds of some control variables have 
to be considered. 

An appropriate standard control concept for the superordinate torque controller is the 
model predictive control principle. For the implementation we used a state space approach. 
The optimization problem is solved by the active set algorithm. For lower computation 
effort a solution with constant parameters was introduced. But this solution doesn't consider 
constraints and loss performance can occur. 

The three examples in section 4 show the capability of the control concept. The extension of 
the control structure is possible simply by completing the model and appropriate weighting 
parameters. In this way the controller should be able to handle more actuating variables or 
other requirements. 

The control quality depends on the quality of the model. Although the subordinate control 
circuits contribute to the linearization of the process behaviour the dynamic parameters may 
be dependent on engine speed or load. For that case a set of linear models could be useful. 
The control approach described in this chapter demonstrates that modern control 
approaches have considerable potential to improve the performance of embedded control 
systems. In addition to better performance also the variability of the systems and the ability 
to handle different control requirements could improve. 
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1. Introduction 

Manufacturing and control procedures for automation require many different technologies. 
The last decade has seen computer technology applied widely in industrial production, 
particularly in manufacturing processes not generally associated with high technology. The 
technology used in the automated production of moulded concrete elements for the 
architectural and building industry has changed dramatically in recent years. The 
introduction of Computer Integrated Manufacturing (CIM) between 1960 and 1980 (Trybula 
& Goodman, 1989), e.g., numerically controlled machines, followed in the 1980 to 1990 
developments in robotics, e.g., advanced robotics (Shell & Hall, 2000) had a significant 
impact on automating the production of moulded concrete elements. 

The types of system that produce concrete elements are mainly machines using hydraulic 
compression and extraction. This is the most common method used to form the complex 
product shapes required by the architectural and building industry. These machines can be 
either stationary, or they can be mobile, with automated mobile machines being the machine 
of preference these days. Other relevant ancillary equipment used in the automated 
production of concrete elements are aggregate mixers, material storage silos and conveyors. 
The concrete elements required by architects and builders often include complex geometric 
shapes; it is for this reason that the central process in their production is the hydraulic press. 
Complex geometric shapes in concrete and their quality control can only produced using 
automated compression machinery (Isayev, 1987; Reinhart, 1987). The various phases of this 
process are (1) material mixture; aggregates of variable form tightly compressed into a 
mould and (2) compression to form and extract the desired product. 

The application of computer aided systems to hydraulic press machines has increased the 
range and variety of concrete elements that these machines can produce for the architectural 
and building industry (e.g., ZENITH, KNAUER, BESSER, etc.). The consistency and quality 
control of landscape and architectural products, such as concrete blocks, curb stones, 
palisades, paving stones, etc., has improved through introducing computer aided systems. 
Computer-control systems, teleoperation and automation technology, modelling and 
simulation tools are some of the technologies and techniques used to acquire the desired 
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functionality in operation control and quality in production (Chryssolouris, 1992; Gutta & 
Sinha, 1996; Marvel & Bloemer, 2000). 

However, quite often various problems arise during the attempt of systems modelling and 
control, in most cases due to insufficient structuring of the system in the real world. In these 
cases, usually artificial intelligence techniques are being applied (Fishwick & Luker, 1991; 
Hwang et. al., 1995). For this reason, the design and control of automated production system 
requires an effective development system that enables the design specifications to be 
implemented and tested prior to the actual implementation and control of the production 
system. The generation and implementation of such systems has become essential to the 
development of automated production systems (Rao et al., 1993; Nise, 1995; Shetty & Kolk, 
1997; Srovnal & Pavliska, 2002). 

Qualitative modelling approaches have been applied for a long period of time with quite 
successful results in most of the cases (Forbus, 1984; Groumpos & Krauth, 1997; de Kleer & 
Brown, 1984; Trave-Massuyes, 1992; Garani & Adam, 2007). One of the aims of this research 
was to accomplish an adequate control structure for a synchronised co-operation of the 
plant machines. Previous work on applications of qualitative modelling techniques (Adiga 
& Gadre, 1990; Lamperti & Zanella, 2003; Mak et. al., 1999; Zhang et. al., 1990) shows that 
there is still further need of work to be carried out for the development of highly intelligent 
qualitative modelling approaches. 

This chapter presents the application of design, simulation and software development 
techniques for the operation and control of a concrete elements production plant and 
particularly, an automated mobile press machine (RoboPress). Teleoperation is required in 
operating the mobile press machine for performing the actual production procedures. The 
plant is consisted of various machines such as a press machine for concrete elements 
production, a mixing machine for aggregates mixing, aggregates storage silos, transport 
buckets and conveyors, etc. The design, operation and control of a concrete mixer machine 
and an autonomous mobile hydraulic press machine are discussed in detail. The whole 
system automates the production of moulded concrete elements for architectural and 
building projects. The research work demonstrates how the design of a state-of-the-art 
industrial plant can be optimised by using qualitative modelling and simulation from 
artificial intelligence and other engineering software tools. Further on, it shows how an 
efficient control algorithm for operating the group of machines can be derived from a 
qualitative modelling approach. 

The rest of this chapter is structured as follows: the following section describes the 
application environment of this research; in section 3 a detailed description of the overall 
plant control system is provided; section 4 provides details of the modelling and simulation 
techniques used for the development and verification of the overall system's operation and 
control algorithm prior to its implementation; in section 5 details are given for the system 
performance evaluation and implementation procedures of the software control modules 
and algorithms. The chapter is concluded in section 6, which presents the outcomes and 
future research work. 

2. Application environment 

The concrete plant under investigation and control is consisted of a group of machines 
including a mobile press machine for the concrete elements production, an aggregates' 
mixing machine, aggregates' storage silos, a forklift loader, feeding conveyors, etc. Other 
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basic machine components include the electronic control board based on a Hitachi EC series 
programmable logic controller of type EC-60HRP. That unit offers up to 60 I/O points, 
direct PC connection (RS232) and monitoring. A number of solid state inductive proximity 
sensors of Telemecanique type XS7C40NC440 for industrial applications and PLC 
compatible are employed, in perfect compatibility with the electronic automated system for 
presence detection. The overall control is based on a closed-loop control system, with the 
PLC unit to control real-time processes, under the operator's control. The driving force 
behind the above control system is the control software, the creation of which is based on 
the construction and execution of descriptive qualitative models. 

The concrete elements production of the plant varies from 6000 blocks per day (8hours) up 
to 14000. Two of the main machines of interest, press and mixer machine, are shown in Fig. 
1 while an overall configuration of the concrete plant is shown schematically in Fig. 2. 




Fig. 1. Press and mixer machines 
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Fig. 2. Concrete plant configuration 

The press and mixer machines are constructed mainly of mechanical and electrical parts and 
devices, incorporating electrical boards, PLC units and other electronic equipment. 
Basically, the plant operates as follows: aggregates from the storage silos are being supplied 
through a feeding conveyor into the mixer machine and the wet concrete produced is 
transported by a forklift loader into the press machine for the actual production of the 
concrete elements. A simplified functional diagram of plant's overall operation cycle is 
shown in Fig. 3. 
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Fig. 3. Plant's operation cycle 



2.1 The mobile press machine 

The press machine is one of the most important production units of the concrete plant (Fig. 
4). The machine produces a variety of concrete products such as blocks, curbs, paving 
stones, etc. It is consisted mainly of mechanical and electrical parts and devices, the 
electrical board and the electronic control system based on a PLC unit (Hitachi EC series) 
and other electronic equipment. The machine is mobile, based on a four wheels metallic 
base. Other basic machine components include a mould table and a tamper head fitted with 
a pair of vibrators each, the aggregates' hopper, the oil pump system, the electro- valves and 
the hydraulic pump system. The mould and tamper units lie on an anti-vibrating mounting 
system to reduce the wear of moulds. 
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Fig. 4. Schematic representation of the autonomous mobile press machine (RoboPress) 

The machine is electrically operated of hydraulic functioning, with automatic control based 
on the PLC unit. The machine operates on a concrete floor slab, inside or outside a building. 
Concrete elements are demoulded directly onto the concrete floor slab during a vibrating 
(starting and main vibration) and compressing cycle. The machine is equipped with a 
bilateral track corrector, with automatic track collision detection and avoidance, automatic 
concrete feeding control and automatic shutdown in interfering with safety grates. The 
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machine's motion is enabled by an electro-motor device in conjunction with a mechanism of 
cogs at the back of the machine. The machine can move in a bidirectional route (forward- 
backward) in two gear speeds, or turn (left/ right) using a fifth wheel. The speed that 
develops is within the range of lOkm/h (first gear speed) to 20km/h (second). The electro- 
motor (reductor) is of 2kw power. The operation of the machine is performed in automatic 
(or semi-automatic) mode, driven by an electro-hydraulic control system based on electro- 
valves and the PLC control unit. 

2.2 The mixer machine 

The mixer is planetary of roughly mixing vertical high resistant steel shaft fitted with mixing 
blades of strong-wearing cast-iron. The machine is equipped with a turnover feeding 
bucket, an electronic cement weighing mechanism, an electro-reductor for bucket's elevation 
with brakes and two spiral drums for wire rope wrapping. The machine performs the 
mixing of the mineral aggregates with water and produces the wet concrete that is fed 
(through a forklift loader) to the mould of the press machine, where is vibrated and 
compacted. A portion of the plant with the mixer platform and materials storage and 
feeding system is shown in Fig. 5. 




Fig. 5. The materials feeding and mixing platform 



2.3 Press and mixer machine operation 

In order to describe the dynamic configuration of the machines' processes, prior to the 
actual specification of the control structures, basic details on machines' requirements and 
activities have to be defined. In consequence, using that information a system's control 
model is constructed and executed. The resulted performance of the model is analysed and 
accordingly in cases that is necessary, its design structure is modified. Finally, the 
appropriate structure of the control algorithm is implemented. 

There are various machine processes and activities under control (Fig. 6), such as the 
aggregates mixing process, mixer feeding process, concrete transfer process and concrete 
elements production process, and activities such as aggregates bucket fill operation, 
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aggregates drawer transfer motion (backwards/forwards), tamper head and mold table 
motion (up/ down), the hydraulic arms actuators motion, vibrators operation, etc. 
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Fig. 6. Plant's operation cycle 

Beyond the above machine processes, an important machine function under strict control is 
press movement along a trajectory path (forward/ backward route and left/ right turns) and 
its correcting maneuvers according to sensors input, in order to avoid collision with any 
obstacles. A generalised view of the control algorithms of those structural processes is 
presented in Fig. 7. In particular, the left part of the figure is a diagrammatic form of the 
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Fig. 7 Generalisations of press and mixer machine operation algorithms 
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basic processes running in the mixer machine, while the right part of the processes running 
in the press machine. The communication link between the processes is established through 
a transport loader. In both diagrams, the processes are running on cycles. On each machine 
operation cycle, certain initialisations (or re-calibrations) are carried out, materials and parts 
positions are checked (e.g. aggregates mixture state, mold table and tamper head position), 
until the final output is produced (e.g. concrete extraction and concrete block elements, 
respectively). 

3. Control system 

This section provides details of the control structures and algorithms developed and used 
for data acquisition and process monitoring for the overall plant control and operation. 
Particular details are provided for the automatic control of the mobile press machine that 
performs (in cooperation with the mixer machine) the production of moulded concrete 
elements for architectural and building projects. 

The overall monitoring and control is based on a closed-loop control system with the human 
in the control loop, for establishing the most optimum control operation. The central control 
board (based on a PLC unit - Hitachi EM-II series) monitors the machines' operations which 
provide feedback in the form of analog input signals through the electrical data 
transmission lines, installed for this purpose. The PLC is programmed to process the data 
signals acquired. All the electronics for the control of the mixer and aggregates feeding 
systems are incorporated into a control console which is pushbutton operated. The mobile 
press machine has its own separate electric control board, with the PLC unit incorporated 
into the main panel and a receiver (antenna) for remote operation (start, stop and turn 
manoeuvres). 

3.1 Data acquisition and control 

The data acquisition and control system is consisted of sensor devices for detecting and 
transmitting in real-time signals about the processes status, such as aggregates' level and 
status, as analog/ digital signals into the programmable control unit for processing. Solid 
state proximity sensors (of inductive type and PLC compatible) are employed for presence 
detection. The central control board (based on the PLC unit) processes the inputs and 
controls the equipment by producing analog control signals in outputs. The end-receiver of 
those control signals are the electrical valves actuators, which control (open/ close) the fluid 
rate of hydraulic valves that activate the silos openings, the mixer machine rotation and 
other processes. The driving force behind the above data acquisition and control system is 
the control software programmes, installed using a combination of programming packages. 

3.2 Press machine remote control and operation 

The control system is based on a Hitachi EC series programmable logic controller of type 
EC-60HRP that offers up to 60I/O points and direct PC connection (RS232) and monitoring. 
The actual programming of the machine control unit is carried out using the Hitachi PLC 
programming software (ActGraph, Actron A.B. Co.) for EC PLC series. The program stored 
is processed in a cycle with an execution speed of 1.5us per basic instruction. All the logic 
program functions of the overall machine control are controlled by that PLC unit. 
A number of solid state inductive proximity sensors of type Telemecanique XS7C40NC440 
for industrial applications and PLC compatible are employed, in perfect compatibility with 
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the electronic automated system, for presence detection. The usable sensing range and 
response time is 0-15mm (.47"), appropriate for metallic targets passing the sensors at 
durations that are not critical. The use of sensors is essential for the accurate control of the 
various machine operations and processes. The PLC reads the status of the inputs, solves the 
logic programmed and updates the outputs. 

The overall machine remote control and operation could be summarised in four stages: cart 
and mould filling operations, compression molding (pressurisation) and mould extraction 
operation. Compression molding basically involves the pressing of wet concrete mixture 
(aggregates) between two halves of a mould (tamper and table) to fill the material in the 
mould form. Compression pressure varies from 150 bar to 200 bar. That functionality of the 
sensory system is shown in Fig. 8. 
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Fig. 8. Simplified sensory system operation flowchart 

The study of the machine's overall operation has enabled the specification of the sensory 
information required. A schematic view of the sensory system developed is shown in Fig. 9. 
The machine's hopper is periodically supplied with aggregates (wet concrete mixture), the 
level of which is sensed with a pressure sensor. Once the hopper is filled with aggregates, 
the door opens (hopper opening sensor turns ON) for certain time interval (materials flow 
timer Tl: 12secs), so that the aggregates transfer-cart to the mould gets filled. Then, the door 
closes (hopper opening sensor turns OFF) and the transfer-cart moves forward (in a 
reciprocal way) in order to fill the mould. Each time the transfer-cart moves forward, a 
sensor is activated and the cart starts to move backwards. This is repeated three times (cart 
pass counter CI: 3). However, after the second pass, a vibrator on the mould is activated 
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(timer T2: 2secs) for the mixture to be distributed equally in the mould. Then, a third pass of 
the cart follows and the mould is filled. Once the cart is back (cart position switch is ON) 
and the mould is filled with wet aggregates, the tamper-head begins to move downwards 
squeezing the mixture in the mould. At the same time, two sets of pairs of vibrators (one 
pair in the tamper and the other in the mould table) are activated (until the tamper goes up, 
about 7secs) in order for the concrete product to become denser. The overall pressurising 
procedure lasts about lOsecs. After that, the tamper-head and the mould-table move up and 
the machine moves forward (~lm) to the next production point. 
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Fig. 9. Sensory system 

It is evident that the sensory system plays an important role in the overall operation of the 
compression press machine. The normal operation of the machine is based on correct 
sensory information. In case an error is detected (sensor signal error), the machines fall into 
a faulty state. A schematic description of the machine states taking in consideration the 
functionality of the sensory system, is given in Fig. 10. Based on that functionality the 
control software is created and downloaded to the PLC unit. 

The mobile press machine is also equipped with a receiver (antenna) for remote control of 
its operation. Beyond the start and stop operations, teleoperation of the machine is required 
in performing the manoeuvres necessary to proceed with the next production line. During 
the navigation along a production line, the machine follows the route automatically based 
on the mounted sensors. If necessary (in case of the machine falling out of the specified 
linear trajectory), automatic route correction is carried out based on the lateral sensors 
signals the machine is equipped with and using the fifth wheel for performing the actual 
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correction manoeuvres. A schematic diagram of the mobile press navigation terrain is 
shown in Fig. 11. 
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Fig. 10. Normal operation and faulty states diagram 
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3.3 Mixer remote control and operation 

The mixer is loaded periodically with a specific volume of aggregates and cement which are 
mixed with water for a certain period of time (~15min). The aggregates silos and the feeding 
conveyor (of the bucket that loads the mixer with aggregates) are controlled manually from 
the central control console. However, the cement and water volumes fed into the mixer are 
controlled automatically, once a certain load is achieved. At this stage, mixing operation 
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starts automatically. In case of insufficient material volume or weight, the mixing operation 
does not start and the corresponding indicators are illuminated at the control console. 

3.4 Overall plant remote control and diagnosis 

The overall plant monitoring and control is carried out through the central board of control 
(Fig. 12). The board of plant control presents an operating and monitoring system. This is 
consisted of a panel of push buttons, key switches and lamp indicators for immediate 
visualisation of the processing signals. The operation of the plant from the central board is 
based on indicating and alarm elements. In order to make the operation simple, a graphic 
plan (mimic diagram) of the plant under control is integrated within the control board, 
which shows at each time point the visualisation of operations flow. 




Fig. 12. The central board of plant control and monitoring 

In addition, an external PC is connected directly (through RS232C) to the central control 
board in order to perform regular maintenance tasks (e.g., programming the internal PLC 
unit) and collect statistical results about the values of specific parameters (e.g., aggregates' 
flow) for further analysis and diagnosis of the plants' operation. 



4. Modelling and simulation 

This section describes the modelling and simulation techniques used for the development 
and verification of the overall plant's operation and control prior to its implementation. 
Considering the complexity of the concrete plant machines' operations, MATLAB Simulink 
simulation tools were considered to describe and analyse its performance. However, 
although quantitative modelling techniques provide much of the required information to 
describe a manufacturing system (e.g., using MATLAB SimMechanics), they are often too 
complex for real-time dynamic systems. For this reason, in addition to the above, QMTOOL, 
a qualitative modelling and simulation tool already applied successfully in robotics research 
(Adam & Grant, 1994), was used to overcome the shortcomings, due to systems complexity 
and extensive numerical computations. Using that tool, we have dealt successfully with 
some of the uncertainties in the positioning of the various machine parts and the control of 
press machine processes. Prior to the actual implementation of the control structure and 
machine operation, qualitative models are generated describing the functionality of the 
processes involved and tested for their effectiveness. The qualitative models are introduced 
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at a high-level abstraction form, using relatively small amount of information, similar to 
human reasoning on studying complex system's behaviour. 



4.1 MATLAB Simulink model 

The design and control of such an automated plant requires an efficient development 
system that would enable the design specifications to be implemented and tested prior to 
the actual development. In order to describe the machines' operation with MATLAB 
Simulink models, several factors have to be considered. This is because there are various 
machine processes and activities under control, such as the aggregates input/ output 
operation, the linear movement of aggregates' transfer drawer, the concurrent movements 
of tamper head and mold table during moulding, vibrators operation, etc. 
Provided that the press machine's overall operation could be described in operational states, 
as shown in a diagrammatic form in Fig. 13, a working model was created. The equations 
that describe the machines' states are given by the following relationships: 



ST_A = (ST_A + ST_C Mv ff) ST_A Mfoff 
ST_B = (ST_B + ST_A Mf «) ST_B Mp ff 
ST_C = (ST_C + ST_B Mpoff) ST_C Mv of£ 



(1) 



where: 

ST_0: 

ST_A: 

ST_B: 

ST_C: 

ST_E: 

Mf: 

Mp: 

Mv: 

Merror: 

Mferror: 

Mperror 

Mverror: 
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Fig. 13. States diagram of the press machine control and operation algorithm 

A partial view of the overall press machine operation structure, using Matlab-Simulink, is 
given in Fig. 14. 
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Fig. 14. Partial view of Simulink-based control system model of press machine 

Particularly, for the description of the compression molding operation and the creation of 
models for simulation analysis, the hydraulic driving units had to be studied. The use of 
hydraulic actuators is of main importance in the overall machine operation, particularly for 
compression molding. So, one of the main goals in compression molding modelling is to 
verify the design and operation of the pressurisation system. For this purpose, a motion 
controller was developed to regulate the pressure (up/ down) in the actuators valves of the 
mould units for simulating the action of compression molding. The final position of the 
hydraulic piston is monitored. A ramp reference input was used to evaluate the tracking 
ability of the subsystems in the model. The actual pressurisation subsystem (see Fig. 15) 
enables to control and monitor the operation pressure which is supplied to the molding 
system (tamper head and mould table) actuators. The control pressure generated is 
proportional to the area and the mass of the hydraulic cylinder piston under control. 
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Fig. 15. Pressurisation subsystem 

Another goal is to verify the design and operation of the sensory system. For this purpose, a 
sensor cell subsystem was developed that emulates the behaviour of a proximity sensor. In 
particular, an inductive proximity sensor senses the proximity of a metal object using an 
oscillator principle. It is essentially comprised of an oscillator whose windings constitute the 
fencing face and where an electromagnetic field is generated. When a metal object is 
positioned within this field, the resulting currents induced into the target form an additional 
load and the oscillations cease. This causes the output driver to operate, producing an ON or 
OFF output signal. Based on this principle of operation, a basic sensor cell subsystem was 
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created (Fig. 16), upon which the overall sensory system model was build. A simple function 
called turn_sensor(mode), where mode is {ON, OFF}, is created to simulate the sensor state 
status. 



i n i n 
■ i i i 

■H-+H-4+ 



Timer (sec| 



1 



JL 



Enable 




Sensor status 



Sensor Cell 



Sensor Mode 
{OFF=0. 0N=1} 0ut1 



Fig. 16. Sensor cell subsystem 



4.2 QMTOOL model 

One of the key advantages of qualitative reasoning in general, is that it can work with 
partial knowledge and information, thus overcoming some of the difficulties of quantitative 
modelling. Although in most of the manufacturing cases predominate the dynamic 
continuous systems, however most of them could be described in a discrete-event manner. 
This is because, in most of the cases, their status changes instantaneously at specific time 
points or time intervals at which events take place, triggered by some actions or activities. 
For example, the operation of the industrial compression molding press machine, although 
it is continuous in time, its individual compression actions (e.g., mould actuators pressure 
up and down) could be described as being executed at specific or separate time points. In 
other words, a discrete-event simulation model could be build and executed, although it is 
also known, that a discrete model is not always used to describe a discrete system and vice 
versa. In addition, their operation usually involves an unknown number of input 
parameters that change in a random way, which is difficult to describe analytically. 
In order to test and validate the operation and control of the press and mixer machines, 
prior to their actual operation control implementation, simulation models were build, based 
on given set of parameters and associated relationships. 



4.2.1 QMTOOL model construction 

It is important to determine the appropriate control and operation algorithm of the 
machines (in synchronisation) prior to their installation in plant and cooperation with the 
rest of the machines' group. It is also necessary to find the tools to test and modify such a 
control structure, even at the design specification level, without an in-depth requirement for 
programming skills. 

QMTOOL is used to produce working models of the machines under control and test them 
in order to acquire the desired functionality. The main focus is in the processes running in 
press and mixer machine. This modelling tool, based on object-oriented techniques methods 
and objects, provides an interactive environment that eases the modelling process. During 
the modelling phase, system models are created by simply connecting input, state and output 
variables (represented as objects) and assigning to them and their connections qualitative 
values of their magnitudes and relationships. Providing that a physical system is given, the 
user selects from a type menu the types of variables needed to define the model and to 
describe sufficiently the structure of the model. The individual attributes for each variable, 
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such as name, initial value, operating range limits (min, max), etc., are assigned from a data 
menu. 

During the execution phase, the system converts qualitative attributes into numerical data in 
order for the appropriate simulations' calculations to take place. This conversion is based on 
qualitative to numerical values conversion tables, describing basic numerical factors such as 
the operating range of the main variables (machine parameters) in a machine process, etc. 
An example of such a qualitative to numerical values conversion table is shown below in 
Table 1. 



Max Min Sign 


+++ 


+-+ 


Range 


Max - Min 


Max 


Step 


(Range/ 5)*Sign 


(Range/ 5)*Sign 


Nvalue 1 


QVal * Step 


QVal * Step 


Upper Lim 


Min + Nvalue 1 


Nvalue 1 


Lower Lim 


Upper - Step 


Upper - Step 


NValue 


(Upper+Lower)/2 


(Upper+Lower)/ 2 



Table 1. Qualitative to numerical values conversion table of variables relationships 

The operating range of a variable (Range) is determined and divided (qualitative 
partitioning) by the amount of qualitative values this variable can obtain (e.g., in our case: 
xl, 1, m, s, xs). Then, after is taken in consideration the actual sign of the qualitative value the 
variable is assigned to, the numerical value (Nvalue 1 ) is determined by multiplying that 
value with a numerical factor that corresponds to this qualitative value (QVal). This 
numerical factor ranges from 1 to 5, respectively for xs (extra small), xl (extra large) and 
(zero). However, in practice it was determined, that the final numerical value (Nvalue) 
should actually be between its respective step-range determined by the Upper and Lower 
limit values. 

In order to clarify the above and understand the role of the qualitative to numerical values 
conversion table, lets examine the following single-variable case example. Suppose that a 
variable is assigned a large qualitative value (numerical factor 4), with a positive sign (+) 
and an operation range defined between (Min) and 100 (Max). Then, according to the 
conversion table, it's numerical value (Nvalue 1 ) would be determined by the following set of 
equations: 



Range = Max - Min 
Step = (Range/ 5) * Sign 
Nvalue 1 = QVal * Step 



> Range = 100 
• Step = 20 
» Nvalue 1 = 80 



(2) 



However, as it was mentioned above, this value should be between its respective step-range 
(determined by the Upper and Lower limit values), namely equal to 1/2 of that range. As a 
result in this example, the actual numerical value will be determined by the following 
equations: 



Upper = Min + Nvalue 1 => Upper = 80 

Lower = Upper - Step => Lower = 60 

NValue = (Upper+Lower)/2) =} NValue = 70 



(3) 
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The above conversion process is being carried out similarly for all the qualitative values 
described in each system's model until they have been converted to quantitative values. 



4.2.2 QMTOOL model simulation 

A QMTOOL system model is defined as a structure of interconnected components 
(machine's activities and processes) presented as input, state and output objects and their 
relationships presented as connection objects. 

The system model of the press machine is based on the given set of parameters and 
associated relationships, examined by the control algorithm. It is build using the types of 
variables that correspond to the following machine parameters: 

• Input variables: initial values of parts' positions, sensors, etc. 

• State variables: machines bucket feed rate, tamper head and mold table states, etc. 

• Output variables: machine move, concrete elements production rate, etc. 

Similarly, the system model of the mixer machine is based on the given set of parameters 
and associated relationships examined by the control algorithm and build using the types of 
variables that correspond to the following machine parameters: 

• Input variables: initial values of aggregates conditions, etc. 

• State variables: machines bucket elevation rate, mixture state, bucket door state, etc. 

• Output variables: concrete production rate, etc. 

Qualitative values are assigned to the components (parameters) and their relationships, in 
order to describe how these variables are linked together, interactively. The system objects 
have already embedded behavioural rules and functionality as (prolog) predicates, which 
however could be easily modified (menu-driven properties) according to the specific model 
construction requirements. Using relatively a small amount of qualitative information to 
define the structure and behaviour of the machines being modeled (symbolic computation), 
prototype models were created and executed (Fig. 17), in order to produce the dynamic 
machine behaviour that reflects the functional requirements for the system. This simulation 
model is a decision-support tool. It presents the dynamic behaviour of the machines and 
reflects the functional requirements for the concrete plant system. It enabled to test and 
redefine the machines operation control models prior to their application, in order to verify 
and finally achieve the desired overall control system cooperative functionality. 
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Fig. 17. Partial view of QMTOOL-based control system models of press and mixer machines 
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In order to show how the actual relations between the input, state and output variables are 
represented qualitatively, the following notation is used: 



M+(Stvar,Invar), M-(Stvar,Stvar), f(Outvar,Stvar) 



(4) 



where M+, M- and f simply indicate that there is a relationship (influence), positive or 
negative (qualitative terms representing the magnitude of the functional relationship) 
between these variables. 

The actual value calculation of the State variable is based on its current value (State(t)) plus a 
sum of values (Influences) of the preceding variables (predecessors), namely: 



State(t)=State(t+l)+Influences(i) 



(5) 



A single influence is calculated taking in consideration the value of the predecessor PredVal 
and the magnitude ConMag of this connection, expressed in qualitative terms (i.e. (zero), s 
(small), m (medium), etc.), namely: 



Influence=f(Predval,ConMag) 



(6) 



C Start ) 



The values of those state variables that do not 

have any predecessors other than themselves 

are calculated first. 



Calculate the state variable values that are 

successors to input variables (or themselves). 

They may have other predecessors or 

successors that do not have any input 

connections. 



111. 



Calculate those states that arc successors to 

inputs (or themselves) and have successors or 

predecessors with input connections (called 

'parallel'). Feedback value is taken in 

consideration. 



IV. 



Calculate the state variables that are successors 

to other states only (or themselves) and are 

'non-parallel. 



Finally, calculate the values of those state 

variables that are successors to other states and 

'parallel'. 



T 



( Finish ") 
Fig. 18. State variables calculation order algorithm 
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In order for state variables' values to be calculated appropriately at the correct time interval, 
the system arranges them in a specific order according to their interconnections within the 
structure of the model. To achieve this, a set of rules is defined that determines the order in 
which the state variables should be calculated. The order of calculations is defined by the 
algorithm given in Fig. 18. 

The mapping of mathematical equations (relationships) into qualitative descriptions is 
carried out using functional M+, M-, arithmetic add, minus, etc. and derivative incr, steady, 
deer, etc. constraints. For instance, the press machine's movement (Mv) is in functional 
relationship with the mold table and tamper head states positions (TTp) and the sensors 
input (Ps), expressed in the following way: 



PressMachineMovement: M+{Ps},M+{TTp} 
MoldTable FeedRate: M+{Ds} 



(7) 



Internally, qualitative modelling involves the interpretation and execution of such 
equations, based on qualitative methods for modelling physical systems (Pearce et. al., 1989; 
Kuipers, 1986). A graphical representation of these constraints, indicating the functional 
relationships between the parameters of interest, is shown in Fig. 19. 
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Fig. 19. Partial constraints of the machines system models 

The overall behaviour produced from models simulation, is derived from the behaviour of 
individual components of the system throughout the models structure. This behaviour is 
presented graphically as Cartesian plots of sequences of time- varying qualitative states. The 
interactions occurring in the system during the simulation process can be easily analysed 
and visualised (see Fig. 20). 
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Fig. 20. Partial view of graphical simulation results 

The importance of using a model for testing real time control operation algorithms, priori 
off-line and evaluate it's performance applying alternative solutions back at the design 
stage, is obvious of the following: 

• Describe and analyse the static and dynamic behaviour of machines processes. 

• Test and verify the machines overall cooperative functionality. 

• Verify the control of each individual machine component. 

• Adjust or redefine the design of the overall plant control system early at the 
specification stage. 

Furthermore, using a qualitative approach, the actual productivity abilities of the machines 
could be estimated on a qualitative basis, closer to human understanding. 

5. Implementation 

This section describes the generation and application of the control software modules 
derived from the simulation models. 



5.1 Control software implementation 

Qualitative machine models were created off-line in order to ensure in safety that machines 
control and operation within the plant could be established in cooperation quite efficiently. 
Once the corresponding machine models have been created and tested extensively for their 
efficient and cooperative control, the overall machines group control system had to be 
implemented. 

A major component of the overall plant operation system is the control software. The control 
software modules were derived from the creation and implementation of system models. 
The overall control and actual programming of the machines group is carried out using 
QMTOOL in conjunction with appropriate Hitachi PLC programming software for EC PLC 
series, for the implementation of the instructions and PLC download. 

The implementation of the control software and the final programming of the central plant 
control board, as well as the individual PLC unit of the press machine, were realised using 
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ActGraph software (Actron A.B. Co.). ActGraph allowed the PC to interface with the Hitachi 
PLC EC series. ActGraph code, in form of a ladder diagram, allows for a final check and 
then, produces a list of instructions. These PLC instructions are then downloaded for 
execution in the PLC. A sample of the PLC code and ladder diagram is given in Fig. 21. 
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Fig. 21. Sample of PLC code and partial view of control diagram 

The ladder diagram in Fig. 21, as well as the PLC code listing, are just a sample of the overall 
control system of the plant machines. In particular, the above are part of the mixer machine 
control system. Each rung of logic in the diagram (e.g. digital or analog inputs/ outputs, 
contacts, switches, output coils, relays, etc.) corresponds to certain PLC instructions or 
subroutines, as in the above code listing. In practice, in most of the cases prior to the actual 
implementation of the operation control modules and creation of the instructions listing, the 
ladder diagram is being created first. Based on such simple schematic descriptions, the 
actual code listings are then created and downloaded into the programmable control unit. 
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6. Conclusion 

Since most of the modern concrete elements production plants today are often faced with 
increasing market demands for further automation, as well as the growing international 
competition, computer-control systems, teleoperation and automation technology, 
modelling and simulation tools are some of the technologies and techniques used to acquire 
the desired functionality in operation control and quality in production. The last decade has 
seen computer technology applied more widely in industrial production, particularly in 
manufacturing processes not generally associated with high-technology. Often, such 
technology was not considered because it was difficult to model manufacturing processes 
using conventional mathematical modelling tools. 

Here, the operation and control of a concrete elements production plant was presented. A 
qualitative modelling approach has been shown to improve production procedures and 
manufactured product quality. The creditability of the overall control system was validated 
using a simulation tool that utilises both conventional numerical methods and more 
advanced qualitative techniques, in order to deal efficiently with the dynamic processes 
present in the concrete plant. Qualitative modelling tools and commercially available 
software were used as tools to aid in the operation and control of concrete elements 
production machines. Once an optimised control model was obtained, via simulation with 
QMTOOL, etc., tests were carried out with machines manufacturing concrete elements. The 
effectiveness of the overall plant control using this approach is defined by the following 
attributes: 

• The ability to describe and analyse the static and dynamic behaviours of machine 
processes. 

• The ability to evaluate the operation of important sub-systems within the machines. 

• The ability to easily redefine the design specifications to optimise the machines control. 

• The ability to produce a realistic and reliable description of the cooperative plant 
machines based on qualitative models. 

• The ability to plan and test path planning scenarios (e.g. route planning) in safety. 

• The ability to reduce the cost of the machines manufacturing and minimise the risk of 
machines malfunctioning. 
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1. Introduction 

The most important single linear integrated circuit is the operational amplifier. Operational 

amplifiers (op-amp) are available as inexpensive circuit modules, and they are capable of 

performing a wide variety of linear and nonlinear signal processing functions (Stanley, 

1994). 

In simple cases, where the interest is the configuration gain, the ideal op-amp in linear 

circuits, is used. However, the frequency response and transient response of operational 

amplifiers using a dynamic model can be obtained. 

The bond graph methodology is a way to get an op-amp model with important parameters 

to know the performance. A bond graph is an abstract representation of a system where a 

collection of components interact with each other through energy ports and are place in the 

system where energy is exchanged (Karnopp & Rosenberg, 1975). 

Bond graph modelling is largely employed nowadays, and new techniques for structural 

analysis, model reduction as well as a certain number of software packages using bond 

graph have been developed. 

In (Gawthrop & Lorcan, 1996) an ideal operational amplifier model using the bond graph 

technique has been given. This model only considers the open loop voltage gain and show 

an application of active bonds. 

In (Gawthrop & Palmer, 2003), the ~ virtual earth 1 concept has a natural bicausal bond graph 

interpretation, leading to simplified and intuitive models of systems containing active 

analogue electronic circuits. However, this approach does not take account the type of the 

op-amp to consider their internal parameters. 

In this work, a bond graph model of an op-amp to obtain the time and frequency responses 

is proposed. The input and output resistances, the open loop voltage gain, the slew rate and 

the supply voltages of the operational amplifier are the internal parameters of the proposed 

bond graph model. 

In the develop of this work, the Bond Graph model in an Integral causality assignment (BGI) 

to determine the properties of the state variables of a system is used (Wellstead, 1979; Sueur 

& Dauphin-Tanguy, 1991). Also, the symbolic determination of the steady state of the 

variables of a system based on the Bond Graph model in a Derivative causality assignment 

(BGD) is applied (Gonzalez et al., 2005). Finally, the simulations of the systems represented 
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by bond graph models using the software 20-Sim by Controllab Products are realized 
(Controllab Products, 2007). 

Therefore, the main result of this work is to present a bond graph model of an op-amp 
considering the internal parameters of a type of linear integrated circuit and external 
elements connected to the op-amp, for example, the feedback circuit and the load. 
The outline of the paper is as follows. Section 2 and 3 summarizes the background of bond 
graph modelling with an integral and derivative causality assignment. Section 4 the bond 
graph model of an operational amplifier is proposed. Also, the frequency responses of the 
some linear integrated circuits that represent operational amplifier using the proposed bond 
graph model are obtained. Section 5 gives a comparator circuit using a bond graph model 
and obtaining the time response. Section 6 presents the proposed bond graph model of an 
feedback op-amp; the input and output resistances, bandwidth, slew rate and supply 
voltages of a non-inverting amplifier using BGI and BGD are determined. Section 7 gives the 
filters using a bond graph model of an op-amp. In this section, we apply the filters for a 
complex signal in the physical domain. The bond graph model of an op-amp to design a 
Proportional and Integral (PI) controller and to control the velocity of a DC motor in a 
closed loop system is applied in section 8. Finally, the conclusions are given in section 9. 



2. Bond graph model 

Consider the following scheme of a Bond Graph model with an Integral causality 
assignment (BGI) for a multiport Linear Time Invariant (LTI) system which includes the key 
vectors of Fig. 1 (Wellstead, 1979; Sueur & Dauphin-Tanguy, 1991). 



Source field 

(MS e . MS f ) 




Detector 
(D) 



Fig. 1. Key vectors of a bond graph. 

In fig. 1, [MS,MS\, (C,I) and yR) denote the source, the energy storage and the 

energy dissipation fields, (Dj the detector and (0,l,TF,GYj the junction structure with 
transformers, TF , and gyrators, GY . 

The state x(t) e 5R and x j (f)e5R are composed of energy variables p(t) and q(t) 
associated with / and C elements in integral causality and derivative causality, respectively, 
u(t) e iK denotes the plant input, y(t) e s Jt the plant output, z(t) e 9? the co-energy 
vector, z d (?) e s Jt the derivative co-energy and D (?) e iR and D (?) e 5R are a mixture 
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of e(t) and fit) showing the energy exchanges between the dissipation field and the 

junction structure (Wellstead, 1979; Sueur & Dauphin-Tanguy, 1991). 
The relations of the storage and dissipation fields are, 



z(t) = Fx(t) 

z d (t) = F d x d (t) 

The relations of the junction structure are, 
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The entries of S take values inside the set |0, ±1, +m,+nj where m and n are 

transformer and gyrator modules; S u and S 22 are square skew-symmetric matrices and 

S n and S 2l are matrices each other negative transpose. The state equation is (Wellstead, 
1979; Sueur & Dauphin-Tanguy, 1991), 



where 



being 



x(t) = A p x(t) + Bu(t) 
y(t) = C p x(t) + D p u(t) 
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M = (l„-LS 22 yL 



(13) 



3. Bond graph in derivative causality assignment 

We can use the Bond Graph in Derivative causality assignment (BGD) to solve directly the 

problem to get A . Suppose that A is invertible and a derivative causality assignment is 

performed on the bond graph model (Gonzalez et al., 2005). From (4) the junction structure 
is given by, 



(14) 



where the entries of J have the same properties that S and the storage elements in (14) 
have a derivative causality. Also, D and D are defined by 
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and they depend of the causality assignment for the storage elements and that junctions 
have a correct causality assignment. 
From (6) to (13) and (14) we obtain, 



z{t) = A* p x{t) + B* p u{t) 

y d ( t ) = c l jc ( t ) +D l u ( t ) 



(16) 

(17) 



where 



A P =Ju+JnNJ 2 , 



(18) 
(19) 
(20) 
(21) 

(22) 

The state output equations of this system in integral causality are given by (6) and (7). It 
follows, from (1), (6), (7), (16) and (17) that, 
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A* = FA' 1 (23) 

p p y ' 

B* = -FA~ X B (24) 

p p p v ' 

C =C A~ [ (25) 

p p p y ' 

D* =D -C A' [ B (26) 

p p p p p v ' 

Considering X (t) = , the steady state of a LTI MIMO system defined by 

X = -A~ [ B U (27) 

SS p p SS \ f 

y ={D -C A~ l B )u (28) 

where X and y , are the steady state of the state variables and the output, respectively. 
In an approach of the BGD, the steady state is determined by 

*„ = F ~ l K U ss P9) 

y S s=D , u S s (30) 

4. A bond graph model of an operational amplifier 

The standard operational amplifier (op-amp) symbol is shown in Fig. 2. It has two input 

terminals, the inverting (-) input and the noninverting (+) input, and one output terminal. 
The typical op-amp operates with two Direct Current (DC) supply voltages, one positive 
and the other negative (Stanley, 1994). 

Inverting input 

terminal \i . v. 

L_ \* Output 




Noninverting input 
terminal 

Fig. 2. Operational amplifier symbol. 

The complex action of the op-amp results in the amplification of the difference between the 

voltages at the noninverting, V , and the inverting, V_ , inputs by a large gain factor, K , 

designed open loop gain. The output voltage is, 

V 0Ut =K{V + -V_) (31) 
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The assumptions of the ideal op-amp are (Barna & Porat, 1989): 1) The input impedance is 
infinite. 2) The output impedance is zero. 3) The open loop gain is infinite. 4) Infinite 
bandwidth so that any frequency signal from to m Hz can be amplified without 
attenuation. 5) Infinite slew rate so that output voltage charges simultaneously with input 
voltage charges. 

The implications of the assumptions are: no current will flow either into or out of either 
input terminal of the op-amp, also, the voltage at the output terminal does not charge as the 

V 

loading is varied and finally, from H-, , V —V= — ^ , if we take the limit when K— >=o, note 

K 

that V + = V_ , which indicates that the voltages at the two input terminals are forced to be 

equal in the limit. 

The assumptions of an op-amp are not completely true in practice, and to be fully competent 
in the analysis and design of op-amps circuits, one must be aware of the limitations. 
Therefore, we propose a more realistic model applicable to DC and low frequencies based 
on Bond Graph with Intregral causality assignment (BGI), since an op-amp is a multistage 
amplifier, it can be represented by Fig. 3. 



Input 
Stage 

v(-) 




Rh C» 

Intermediate 
, „ J --Stage 



Output 
Stage 



Fig. 3. Bond graph model of an operational amplifier. 

The individual stages used in op-amp are separately chosen to develop different amplifier 
characteristics. Those amplifier characteristics which are determined by a given stage 
depend on whether it functions as an input stage, intermediate stage or output stage. So, the 
bond graph model of the op-amp is composed by 3 stages, which are: 

• Characteristics of the differential input stage of an operational amplifier are the most 
critical factors which affect the accuracy of an op-amp in providing voltage gain. Errors 
effects of following stages are reduced in significance by the gain isolation provided by 
the first stage. This input stage considers the two input terminals of op-amp, the 

differential input resistance, denoted as R , which is the resistance between the 

inverting and non-inverting inputs and K is the open loop gain. 

• The intermediate stage introduces the frequency compensation of the op-amp using a 
lag network. Also, using a MTF , the slew rate of the op-amp is considered. 

• Following the input and intermediate voltage gain stages of an op-amp, it is desirable to 
provide impedance isolation from loads. In this way the characteristics of the gain 
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stages are preserved under load, and adequate signal current is made available to the 
load. In addition, the output stage provides isolation to the preceding stage and a low 
output impedance to the load. This stage is formed by the output terminal, the output 

resistance of the op-amp denoted as R and the adjustment of supply voltages, positive 

voltage V and negative voltage V , are applied using a MTF element. 

The usefulness of the bond graph model of an operational amplifier can be shown, applying 
this model to /uAlAX op-amp by Fairchild Semiconductor Corporation (Stanley, 1994; 
Gayakward, 2000) and, 7X084 and OP21 by Texas Instruments (Stanley, 1994; 
Gayakward, 2000) whose data sheets are shown in Table 1. 



Op-amp 


juAl 41 


TL0S4 


OP21 


r. (n) 


2xl0 6 


10 12 


6x10" 


R (n) 


75 


75 


70 


K 


2xl0 5 


2xl0 5 


1.8xl0 6 


SR(V/jUs) 


0.5 


13 


2.8 


v,v e {v) 


±15 


±18 


±22 



Table 1. Data sheets of juA741 , 7X084 and OP27 . 

Using the data sheets of an op-amp, the high cutoff frequency of the open loop voltage gain, 
/ , is determined. Moreover, the compensation parameters of the bond graph model are 
defined by 



fo 



1 



(32) 



2ttRC c 

From (32) and the data sheets of the juA741 , 7X084 and OP21 op-amps, the parameters to 
obtain the frequency response are given in Table 2. 



Op-amp 


juA14l 


7X084 


OP21 


*,(«) 


10 3 


10 5 


10 3 


C„(F) 


3.06652x10" 


5.305 lxlO" 6 


1.9894x10 5 



Table 2. Parameters of frequency response. 

Substituting the parameters of Tables 1 and 2 to bond graph model of each op-amp, the 

frequency responses of juA 741 , 7X084 and OP21 op-amps are shown in Fig. 4, 5 and 6, 

respectively. 

Note that, the frequency responses of the /uA74\ , 7X084 and OP21 op-amps are very 

close respect to the data sheets of these op-amps (Stanley, 1994; Gayakward, 2000). Also, 

decibel gain is K = 20 Log \K). The next section an application of the bond graph model 

of an op-amp to prove the time response is presented. 
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Fig. 4. Frequency response of jjA74\ op-amp. 
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Fig. 5. Frequency response of 2X084 op-amp 
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Fig. 6. Frequency response of OP21 op-amp. 



5. Comparator circuit 

Comparator circuits represent to the first class of circuits we have considered that are 
basically nonlinear in operation. Specifically, comparator circuits produce two or more 
discrete outputs, each of which is dependent on the input level (Floyd & Buchla, 1999). In 
this application, the op-amp is used in the open loop configuration with the input voltage on 
one input and a reference voltage on the other, which is shown in Fig. 7. 
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nri Generator 1 




Fig. 7. Bond graph model of a comparator op-amp. 

Applying a voltage v + (fj = 38in.[2ttf t)V to the noninverting input, where f = 0.15Hz 

and v = IV to the inverting input of the bond graph model of Fig. 7. Also, the supply 

voltages are V = \1V and V = —\2V , the time response of the comparator circuit using 
the )jl Al 41 op-amp is shown in Fig. 8. 
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Fig. 8. Time response of the comparator op-amp. 

In according with the objective of the comparator circuit, the time response of the Fig. 8 is 
satisfactory (Stanley, 1994; Gayakward, 2000; Floyd & Buchla, 1999). However, if the 
frequency of the input signal increases, we have to consider the response of phase shift 
versus frequency, which is obtained using the proposed bond graph model and is shown in 
Fig. 9 for /uAlAX op-amp phase shift between the input and output signals. 

If the noninverting input voltage is v (t ) = 0Asm(27rfj \V where f =\00Hz, the 

inverting input voltage is v_ (t ) = 0V and, the supply voltages are V =\2V and 
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V = —\1V , then the time response of the comparator circuit using ^^4741 op-amp is a 

square waveform of magnitude +\2V and a phase shift of 6 = —86.8 respect the input 
signal, which is shown in Fig. 10. 
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Fig. 9. Phase shift versus frequency of /uA741 op-amp 
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Fig. 10. Time response of comparator with V ( t) = 20 sin ( 200^"M . 

The phase shift of the output signal of Fig. 10 can be verified from the following equation, 
calculating the lag time of the output signal, 



t = 



e 



360/ 



(33) 



where is the phase shift of the bode plot for /LiAlAX op-amp is the Fig. 8, and f is the 

frequency of the input signal. In this case t = 1A\ms . 

In next section op-amp configurations with negative feedback in the physical domain are 

proposed. 
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6. Feedback operational amplifier 

An op-amp that uses feedback is called a feedback is called a feedback amplifier. A feedback 
amplifier is sometimes referred to as a closed loop amplifier because the feedback forms a 
closed loop between the input and the output. A feedback amplifier essentially consists of 
two parts: an op-amp and a feedback circuit. The feedback circuit can take any form 
whatsoever, depending on the intended application of the amplifier. This means that the 
feedback circuit may be made up of either passive components, active components, or 
combinations of both (Gayakward, 2000). 

A closed loop amplifier can be represented by using two blocks, one for an op-amp and 
another for a feedback circuit. There are four ways to connect theses two blocks. These 
connections are classified according to whether the voltage or current is fed back to the 
input in series or in parallel, as follows: 1) Voltage series feedback, 2) Voltage shunt 
feedback, 3) Current series feedback and 4) Current shunt feedback (Gayakward, 2000) and 
(Floyd & Buchla, 1999). 

The voltage series feedback configuration is one of the most important because this is 
commonly used. An in depth analysis of this configuration in the physical domain is 
presented in this section, computing voltage gain, input resistance, output resistance and the 
bandwidth. 

6.1 A bond graph model of a noninverting amplifier 

An op-amp connected in a closed loop configuration as a noninverting amplifier is shown in 
Fig. 11. The input signal is applied to the noninverting input. A portion of the output is 
applied back to the inverting input through the feedback network in the physical domain. 
The BGI and the Bond Graph in a Derivative causality assignment (BGD) are shown in Fig. 
11, in order to get the symbolic expressions of the closed loop system in steady state 
applying the methodology given in section 1. 



■ Generator 1 



Generator 1 



1 Y 




Fig. 11. Bond graph models of a noninverting amplifier. 

The closed loop gain, A CL , can be determined using the BGD approach, from (30) we have, 



A C l - 



)'s: 



D 



(34) 



where y and U are the steady state values of the output and input, respectively. 
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We now derive the closed loop gain of the noninverting amplifier using the BGD. The key 
vectors of BGD are given by, 



x(t) = q 17 (t);x(t) = f„(t);z(t) = e„(t) 
u(t) = e 1 (t);y(t) = e 9 (t) 

D ind (t) = [e 4 (t) e 7 (t) e 9 (t) f u (t) f u (t) f u (t)} 
D outd (t) = [f 4 (t) f 7 (t) f 9 (t) ejt) ejt) ejt)] 
the constitutive relations are, 



F = 



1 



(35) 



(36) 



L d =diag\—,—,R F ,R,R h 



and the junction structure is, 







1 

m . 



Jn = 



,J 22 










/ti, 



-K 



m, 





—m 

1 OT, 



/,. 







K 



-1 

1 
-1 



-K -1 

m, m, 



--m 2 

1 











1 _ 


-1 




K 


m l 




m t 





'^23 






























;y 32 =[o 0-110] 



J 1} ,J n J 31 J 3J 



(37) 



(38) 



From (21), (22), (34), (36), (37) and (38), the closed loop gain of the noninverting amplifier is, 

R L \jn 2 RK[R + R f ) + m l RR o ] 



A, 



Wj^A + KRmJL^ + R L m\m x \_R f (R + R t ) + RR t ] 
where A = (R + R.)(R L + R f .) + RR. 



(39) 
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Note that the closed loop gain (39) takes account the internal parameters and the external 

elements connected to op-amp. 

A normal operation of the op-amp using the bond graph model indicates the modules of 

MTF's are m x = 1 , the slew rate is sufficient of the op-amp, and m i = 1 , the supply voltages 

allow to obtain the output voltage depending the input voltage and the gain of the op-amp. 

Considering, m i = 1, m l = 1 and R = , we obtain 



(4j, 



R + R, 



( 



R + 



K 



R..+RR.+- 



V 



R f R 
R 



(40) 



Applying the Lint (A ) , the ideal closed loop gain of this amplifier, (A ) , is determined, 



KJ,=1 + ^ 



(41) 



The time response of the noninverting amplifier using /uA741 op-amp, R = 10KD. , 
R = \90KQ and the input signal is v (?) = 0.5sin(2;r/> )V where / = \KHz , is shown 
in Fig. 12. 




Fig. 12. Time response of the noninverting amplifier. 



6.1.1 Input resistance of the noninverting amplifier 

The input resistance of the noninverting amplifier can be determined using the BGD of Fig. 

11. Considering the output f t (?) , the submarrices Jj, ,J i2 andj, 3 from (38) are changed 
by 
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J 23 =[l 0];J 13 =J 33 =0 

From (21), (22), (37) and (38) the input resistance is determined by, 



(42) 



R [(R + R.)(R r + R f ) + RR.1 + KRR r R + rJr.(R + R.) + RR~\ 

R = ^lv >n_j — n <j l_^ L _^2^ ,j_ , A 

(R + R f )(R o+ R L ) + RR L 

where ( e J and ( f J are the steady state values of the input e x and output f { , respectively. 



If R =0 we reduce, 



{R iF \= R i 



i + - 



KR 



R + R 



RR. 



f J 



R + R, 



(44) 



finally, the term 



R R 



■D R 



R + R, 



KR 
R + R, 



hence the ideal input resistance of the noninverting 



amplifier (R jF ) is defined by, 



(K), = R t 



i + - 



KR 



R + R 



(45) 



f J 



The result of (45) indicates that the ideal input resistance of the op-amp with feedback is 

(l + KR/R + R ) times that without feedback. In addition, the equation (43) allows to 

determine the input resistance of the op-amp considering the internal parameters and 
external elements for this configuration. Equation (45) can be verified in (Stanley, 1994; 
Gayakward,2000). 

6.1.2 Output resistance of the noninverting amplifier 

Output resistance is the resistance determined looking back into the feedback amplifier from 
the output terminal. A BGD that allows to obtain the output resistance of a noninverting 
amplifier is shown in Fig. 13. 
The key vectors of the BGD of Fig. 13 are, 

x(t) = q ls (0; m = f 16 (0; z (0 = e, 6 (0 
u(t) = e l (t);y(t) = f l (t) 

D,„At) = [e 4 (t) e 7 (t) ejt) f n (t) e u (t)\ ^ 

D otttd (t) = [f 4 (t) f 7 (t) f m {t) e n {t) f n (t)] 
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Fig. 13. Derivation of output resistance of a bond graph model of a noninverting amplifier, 
the constitutive relations are, 



111 1 



R R R. 



R, 



(47) 



F-- 

and the junction structure of the BGD is, 

"0 -1 
K -1 
10 
10-10 




c 



J„ 













-1 





:J« 


=j: = 


-1 












1 










(48) 



(49) 



/= [ -K -l];J n =J a =J n =J i3 =0 

Substituting (22), (47), (49) into (21) the output resistance of a noninverting amplifier is 
obtained, 



« -^-w 



R o [R.(R + R f ) + RR f ] 



(/)„ v ' r i (r+r i +kr)+rr,+r i X r+r ,) 

Calculating (R ) = lim R we have, 

R .X R + R r) 



(50) 



( R A 



R + R+KR + R 



(51) 
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finally, R + R + R + KRD R + R + KR , the ideal output resistance (R oF ) of the 
noninverting amplifier is given by, 



(*,), 



R 



KR 
R + R r 



(52) 



1 + 



This result shows that the ideal output resistance of the noninverting amplifier is 

1 

times the output resistance R of the op-amp. That is , the output 

(l + KR/R + R^ 

resistance of the op-amp with feedback is much smaller than the output resistance without 
feedback. In addition (52) can be verified in (Stanley, 1994; Gayakward, 2000). 



6.1.3 Bandwidth of the noninverting amplifier 

The bandwidth of an amplifier is defined as the band (range) of frequency for which the 
gain remains constant. Manufacturers generally specify either the gain-bandwidth product 
or supply open loop gain versus frequency curve for the op-amp (Gayakward, 2000). 
Fig. 4 shows the open loop gain versus frequency curve of the /uA741 op-amp. From this 
curve for a gain of 200,000, the bandwidth is approximately 5 Hz ; or the gain-bandwidth 
product is \MHz . On the other extreme, the bandwidth is approximately \MHz when the 
gain is 1. Hence, the frequency at which the gain equals 1 is known as the unity gain 
bandwidth (UGB). 

Since for an op-amp with a single break frequency/ , the gain-bandwidth product is 
constant, and equal to UGB, we can write, 



UGB = {K){f o ) = {A CL ){f F ) 
where f F bandwidth with feedback. 
Therefore, the bandwidth of an feedback op-amp is, 



(53) 



./;- 



(*x/.) 



(54) 
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Fig. 14. Frequency response of the noninverting amplifier. 
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The frequency response of the noninverting amplifier based on BGI of Fig. 11, using /uA741 
op-amp, R = IKD. , R = 5KQ and the input signal is v (f) = 1.0 sin {in ft) V where 

/ = 30KHz , is shown in Fig. 14. 

Note that the frequency response of this amplifier indicates that the closed loop gain is 

A = 6 = \5.56dB until approximately \66KHz , which is verified from (54). 

6.1.4 Slew rate 

Another important frequency related parameter of an op-amp is the slew rate. The slew rate 
is the maximum rate of change of output voltage with respect to time, usually specified in 

V /J.S . Ideally, we would like an infinity slew rate so that the op-amp's output voltage 

would change simultaneously with the input. Practical op-amps are available with slew 

rates from OAV/ jus to well above 1000F/ jus . The slew rate (SR) can be obtained by, 



InjV 
SR= — —^V I fus 
10 6 



(55) 



where J is the input frequency and V is the peak value of the output sine wave. 

In order to show the effect of the slew rate of an op-amp, the time responses of the 
noninverting amplifier based on BGI of Fig. 11, using juAlAX and OP21 op-amps, 

R = IKD. , R f = 5KD. and the input signal is v.(t) = l.0sin(2n f.t)V where f. = 30 KHz , 
are shown in Fig. 15. The ideal closed gain is then (A) = 6 . 
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Fig.15. Effect of the slew rate of the noninverting amplifier using /jA741 and OP21 op- 
amps. 
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From (55), the minimum slew rate of an op-amp with the previous conditions is 
SR = 1.13F / jus . Therefore, the slew rate of the OP21 op-amp, which is 2.8 F / jus is 
enough for the input signal with the given conditions. Fig. 15, shows the output signal of the 
/uAlAl op-amp has distortion because of the slew rate is 0.5 V j /us . 

6.1.5 Supply volages 

The most linear integrated circuits, particularly op-amps, use one or more differential 

amplifier stages, and differential amplifiers require both a positive (V.) and negative (V ) 

power supply for proper operation of the circuit. 

The supply voltages of the proposed bond graph model of an op-amp can be tested 

applying an input signal v.(t) = 3 sin(27tf.)t , f. = 30KHz , R = IKQ, R = 5KQ and 

V = \1V and V = —\1V to juAIAX op-amp, the time response is shown in Fig. 16. 

The ideal closed loop gain is (A ) = 6 , of this form the ideal output signal would be 

V (t) = 1 8 sin(2 71 f)t . However, the supply voltages of the op-amp are +\2V , limiting the 

output signal to this rank of voltage. 

Next section, the op-amp integrator that is one the most interesting configurations on a bond 

graph model is designed. 




Fig.16. Time response with slew rate and supply voltages using /J.A1AX op-amp. 



6.2 The operational amplifier integrator 

An op-amp integrator simulates mathematical integration, which is basically a summing 
process that determines the total area under the curve of a function (Floyd & Buchla, 1999). 
A basic integrator on a BGD model is shown in Fig. 17. Notice that the feedback element is a 
capacitor that forms an RC circuit with the input. 
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Generator 1 



Fig. 17. Bond graphs of an operational amplifier integrator. 

In order to determine the transfer function of the op-amp integrator based on bond graph 
model of Fig. 17, the methodology given in Section 1 is applied. The key vectors are, 



x(t) 



?1.(0 


;x(t) = 


/,« 


;z(t) = 


« u (0 


L?.,(o_ 




U,W_ 




U.wJ 



"(0 = e,(f) 



D fc (0 = [e 4 (0 c,(0 e t (t) f u {t) eJt)] T 
D„„(t) = [f 4 (t) f 7 (t) f 9 (t) e n {t) f 15 (t)] T 



the constitutive relations are, 



F = diag 



M 



(56) 



(57) 



111 1 

L = diag < — , — , — , R, — 

\ R . R R. R, 



and the junction structure matrix is, 









-m 





-m 





1 

m, 
0-10 

1 
0^0 

1 
m. 



;s, 



-i 

—m 

1 



-K 



;s M 
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1 


-1 














—m 



S n =[0 -1 0];5 33 =1 
5 31 =["1 0];5 1I= 5 13 =0 



(58) 



(59) 
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From (8), (13), (57), (58) and (59) the A matrix is, 
-1 



(R o +mlR L )(R + R.) — (R + R.)m 2 R L 

— (R +m 2 R)mKRR -^(A + mKRRR) 
C V " ' ' C,R, ' 



where A = R o [R l (R + R.) + RR. ] + m 2 2 RR L R. 
Substituting (13), (58) (59) into (9), B > is determined by, 



B 



R ,( R ° + < R l) 

— (mKRRR ) 



From (10), (13), (57), (58) and (59) the C matrix is, 



C 



— (R + R. ) R o R L — m 2 RR.R L 



Finally, substituting (13), (58) and (59) into (11), D is given by, 



(60) 



(61) 



(62) 



1 
D =—RRR 

" A " • L 
The transfer function of a system represented in space state can be calculated by, 



(63) 



G(s) = C p (sI n -A p )' l B p+ D p (64) 

Substituting (60), (61), (62) and (63) into (64), the transfer function of the op-amp integrator 
of Fig. 17 is, 



G(s) 



(s 2 CCRR +sCRm 2 -Kmrn^R R 

\ h h o o \ 1 2 / L i 



2 CCRA + s(CRA + CRR RKmm +Cm 2 A] + m 2 K 

h h \ h h L i 1 2 1/ 1 

where A = (R + R. ) (.K + m]R L ) . 

Considering a normal operation of the op-amp, m x = m i =1 and R = , we have, 



(65) 



GAs) 



2 CCRR 

s "- J ^ + s 

K 



CA 



K 



( R 

— + 



CR 



CR 
K 



1 
K 



— + 1 



(66) 
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In addition, applying lim G (s) , the ideal transfer function, G (s) , is determined by, 

"I 



GXs) 



sRC 



(67) 



Note that, (67) is the typical transfer function of a integral controller. However, the equation 
(65) is the transfer function of a integral controller based on op-amp and considering their 
internal parameters and external elements. 
The time response of the integrator circuit using /uAlAX op-amp and R = 10Q , C = 100 /uF 

and v (t ) = sm(2n ft )V where f = \KHz shows in Fig. 18. The output signal proves that the 
bond graph model of Fig. 18, is an integrator of the input signal. 
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Fig. 18. Time response of the integrator. 

The frequency response of the integrator circuit of Fig. 17 is shown in Fig. 19. Note that (67) 
is represented by Fig. 19. 
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Fig. 19. Frequency response of the integrator circuit. 

Next section an important application of the op-amp that is an active filter in the physical 
domain is proposed. 
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7. Active filters using bond graph models 

An electric filter is often a frequency selective circuit that passes a specified band of 
frequencies and blocks or attenuates signals of frequencies outside this band. Filters may be 
classified in a number of ways (Gayakward, 2000): 

1. Analog or digital. Analog filters are designed to process analog signals, while digital 
filters process analog signals using digital techniques. 

2. Passive or active. Passive filters use resistors, capacitors and inductors in their 
construction and the active filters employ transistor or op-amps in addition to resistors 
and capacitors. 

An active filter offers the following advantages over a passive filter: a) Gain and frequency 
adjustment flexibility, b) No loading problem, c) Cost. 

A filter will usually conform to one of four basic response types: low-pass, high-pass, band- 
pass and band-reject. 

7.1 Low-pass filter 

A low-pass filter allows only low frequency signals to pass through, while suppressing 
high-frequency components. The bond graph model of a low-pass filter is shown in Fig. 20. 
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Fig. 20. Bond graph model of a low-pass filter. 
The key vectors of the bond graph of Fig. 20 are, 



x(t) 



9„(f) 


;x(t) = 
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The constitutive relations of the elements are, 
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and the junction structure is 
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(72) 



From (8), (13), (69), (70) and (71), the A matrix of the low-pass filter is, 
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RC AC, 



where A = R c [[R L + R f ) (R + R. ) + RR. ] + m\ [RR L (R f +R,) + R L R f R, ] 
Substituting (70) and (71) into (13), the B matrix is obtained 
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(74) 



Finally, from (10), (13), (69), (70) and (71), the C p matrix is 
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C = 

p 



m ? R r r- / v -, RR.R 

-^- L \R(R.+R r ) + R.R f \ — — 

AC, L V ' ' } fJ AC,, 



(75) 



and D =0 

p 

From (73) to (75), and (64), and considering ideal characteristics of the op-amp, R = , 
R = oo , K = co and R L = co , the ideal transfer function of the low-pass filter, denoted by 



A,., R r C, 



R 



G (s) is obtained G (s) = — where A = 1 + - 

s + l/RC c R 

The ideal transfer function of the low-pass filter indicates that on the pass band the gain is 
almost A CL and the pole of the system is located at the high cutoff frequency f h defined 

by /„ 



InCR 



The frequency response of the low-pass filter based on bond graph model using the uA741 
op-amp and considering the numerical values of the Table 3 is shown in Fig 21. 
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Table 3. Numerical values of the low-pass filter. 
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Fig. 21. Frequency response of the low-pass filter. 

The Fig 21 shows that f = \KHz . Also, we have a first order filter, because the filter has a 
pole and the rolloff rate of the filter is 20 dB per decade. 



7.2 High-pass filter 

A high pass filter allows only frequencies above a certain break point to pass through. In 
other words, it attenuates low frequency components. A first order high-pass filter on bond 
graph model is shown in Fig. 22. 
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Fig. 22. Bond graph model of a high-pass filter. 

The mathematical model of the filter can be determined in the same manner that the low- 
pass filter. However, for all the remaining filters of this paper, we use the software 20-sim to 
know the performance of the filter on the physical domain, considering the internal 
characteristics of the op-amp and the external components connected to the op-amp in order 
to get the filter type. 

Using the same numerical values that the low-pass filter, the frequency response of the filter 
shows in Fig. 23. 
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Fig. 23. Frequency response of the high-pass filter. 

The Fig. 23 indicates that we have a first order filter and the low cutoff frequency f is 
IKHz . 



7.3 Band-pass filter 

The band-pass filter can be thought of as a combination of high and low pass filters. It 
allows only frequencies within a specified rang to pass through. In Fig. 24 a band-pass filter 
on bond graph model is proposed. The characteristic of this filter is a narrow band-pass 
using multiple feedback . 

In order to get the performance of this filter, we use the internal parameters of the /jA741 
op-amp given in Tables 1 and 2, and the external components are shown in Table 4. 
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Fig. 24. Bond graph model of a band-pass filter. 
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Table 4. Numerical values of the external components 

The frequency response of the band-pass filter is shown in Fig. 25. 
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Fig. 25. Frequency response of the band-pass filter. 

Note that, the response of the filter allows to pass the frequency component of \KHz with 
a closed loop gain of 10 and the low and high frequencies respect to \KHz have a small 
magnitude. 



7.4 Band-reject filter 

The band-reject filter allows everything to pass through with the exception of a specific 

range of frequencies. The band-reject filter is often called notch filter, because it is commonly 

used for the rejection of a single frequency. The Fig. 26 shows this filter on a bond graph 

model. 

Using the parameters of the /uAlAX op-amp given on Tables 1 and 2, and the external 

elements of the op-amp of Fig. 26 are gotten in Table 5, the frequency response of the filter is 

shown in Fig. 27. 
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Fig. 26. Bond graph model of a band-reject filter. 
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5. External elements of the band-reject filter. 
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Fig. 27. Frequency response of the band-reject filter. 

In Fig. 27, we show that 60Hz power line frequency can be attenuated using this filter in 
the physical domain. 



7.5 Applying active filters 

A filter is a circuit which inhibits the transfer of a specific range of frequencies. So, we can 
apply active filters to select a frequency component of a complex signal. An interesting 
application of filters on bond graph models is shown in Fig. 28 where we have an input 
signal given by, 

V. =0.1sin[2^(lO//z);] + 0.1sin[2^(500 J fYz)?] + 0.1sin[2^-(20A://z);] (76) 

In Fig. 28, the input is applied to low-pass filter denoted by opampl, to high-pass filter 
denoted by opamp2 and to band-pass filter denoted by opamp3. 
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Also, the external elements of Fig. 28 are obtained on Table 6 and using the parameters of 
the fiAlAl op-amp given on Tables 1 and 2, the frequency responses are shown in Fig. 29. 
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Fig. 28. Compose signal applied to bond graph filters. 
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Table 6. External elements of the compose system. 

In Fig. 29, the high cutoff frequency is f = \QHz for the low-pass filter, the low cutoff 

frequency is f = 20KHz for the high-pass filter and the center frequency is f = 500Hz . 

Finally, the time response of each filter is shown in Fig. 30 and we note that the objective of 
the filters is successful. 
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Fig. 29. Frequency response of a complete system. 
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Fig. 30. Time response of the complete system. 

Note that the bond graph model of the op-amp to design controllers, analog computers and 
general linear applications can be applied. 



8. Controller based on bond graph model of an operational amplifier 

Another important application of operational amplifiers is to implement controllers and can 
be used to improve the performance of a closed loop system. A proportional and integral 
(PI) controller designed in the physical domain to control the velocity of a DC motor is 
applied (Barna & Porat, 1989). 
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The noninverting and integrator configurations of an op-amp are used to derive the PI 
controller. Also, the input reference of the system is voltage V and on the feedback a 

element GY to connect the output with the summing junction is applied. The BGI of the 
closed loop system is shown in Fig. 31. 
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Fig. 31. PI Controller of DC motor on a bond graph model. 

The BGI of Fig. 31 can be used to determine the state space representation or the transfer 
function of the closed loop system. However, in this case we only show the simulation with 
the objective to test the PI controller connected to DC motor in a closed loop system. 
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Fig. 32. Time response of the output of the DC motor using a PI controller. 
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The time response of the output using a fiA741 op-amp, R =10KQ. , R = 90KQ, 

R a =2D., L a = OMmH , J = O.OOlKgm* and b = \Nm I rad I s , is shown in Fig. 32. 

Note that the PI Controller yields a steady state output equal to input reference of the 
system and the transient response depends of the parameters of the system and controller. 

9. Conclusions 

In this work an operational amplifier is shown in a bond graph model. The proposed bond 
graph takes account the input and output resistances, open loop gain, supply voltages, slew 
rate and frequency compensation of an operational amplifier. Therefore, an advantage of 
this model is to determine the performance of applications based on operational amplifiers 
considering the type of linear integrated circuit to obtain the internal parameters from the 
data sheets. Finally, open loop and closed loop configurations of the operational amplifier in 
the physical domain have been shown. 
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1. Introduction 

Hypermobile robots belong to the group of hyper-redundant articulated mobile robots. This 
group can be further divided based on two characteristic features: the way the forward 
motion of the robot is generated and the activity of its joints, as shown in Table 1. 



Propulsion— > 
Joints^ 


External propulsion elements: 
legs, wheels, tracks 


Movement is generated by 
undulation 


Active joints 


Hypermobile robots: 

• Koryu-I and Koryu-II (Hirose, 1993) 

• Snake 2 (Klaassen and Paap, 1999) 

• Soryu (Takayama and Hirose, 2000) 

• Millibot Train (Brown et al., 2002) 

• Moira (Osuka & Kitajima, 2003) 

• Pipeline Explorer (Schempf et al., 2003) 

• Omnis family (Granosik et al., 2005) 

• MAKRO plus (Streich & Adria, 2004) 

• KOHGA (Kamegawa et al, 2004) 

• JL-I (Zhang et al, 2006) 

• Wheeeler (Pytasz & Granosik, 2006) 


Snake-like robots: 

• Active Cord Mechanism whole 
family of robots (Hirose, 1993) 

• Slim Slime Robot (Ohno & 
Hirose, 2000) 

• Snake robots by Dr. Gavin 
Miller (snakerobots.com) 

• Perambulator-II (Ye et al., 2007) 


Passive joints 


Active wheels - passive joints robots: 

• Genbu 3 ( Kimura & Hirose, 2002) 





Table 1. Articulated mobile robots 

Articulated robots can also be divided in other way, as suggested by Robinson and Davies 
(1999), into three groups: discrete, serpentine and continuum. Most robots are discrete 
mechanisms constructed from series of rigid links interconnected by discrete joints. In case 
of robotic manipulators joints are usually one degree of freedom (DOF) but in case of 
articulated mobile robots we can find 2 and 3 DOF joints more often. Multi degree of 
freedom joints create naturally spatial devices, which can bend in any direction. 
Serpentine robots also utilize discrete joints but combine very short rigid links with a large 
density of joints. This creates highly mobile mechanisms producing smooth curves, similar 
to snake. And therefore representatives of this group of robots are also called snake-like 
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robots, as shown in Fig. 1. Some of the hypermobile robots feature very short links relative 
to the cross section size and also belong to this group. 




Fig. 1. Active Cord Mechanism (left) reproduced from (Hirose, 1993), ACM-R3 reproduced 
from (Mori & Hirose, 2002) 

Continuum robots do not contain rigid links and identifiable rotational joints. Instead the 
structures bend continuously along their length via elastic deformation and produce motion 
through the generation of smooth curves, similar to the tentacles. The good example of 
continuum mobile robots is Slim Slime Robot (see Fig. 2). 




Fig. 2. Slim Slime Robot reproduced from (Ohno & Hirose, 2000) 

In the next part of this paper we present a few major projects of hypermobile robots from 
around the world, focusing on comparison of design concepts, main features, performance, 
teleoperation or automated operation techniques. Finally, we present our own hypermobile 
robot Wheeeler showing design concept, model, simulation results, construction and tests of 
prototype. In conclusion we summarize this digest with a table showing basic properties of 
presented robots. We also show advantages and disadvantages of our design and future 
plans. We hope to show some general ideas on designing, constructing and control of such 
complex devices as hypermobile robots with many redundant degrees of freedom. 



2. History of hypermobile robots 

One of the scenarios where hypermobile robots could play the main role is search and 
rescue. The intensified work in this field was related to the large catastrophes happened in 
different countries: Kobe earthquake, terrorist attacks on World Trade Center in New York 
and bomb attacks on trains in London and Madrid. In this application robots are intended 
to slither into the debris and gather visual information about possible victims. Therefore, it 
is expected that robot fits small openings, can travel in the rummage of structured 
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environment and overcome obstacles. The other important application for hypermobile 
robots are inspection tasks in sewage systems, gas pipes or venting systems. 
The first practical realization of a hypermobile robot, called Koryu or KR-I, was introduced 
by Hirose and Morishima (1990) and later improved with version KR-II (Hirose et al., 1991), 
as shown in Fig. 3. KR-II was developed with premise that it will be applied as a mobile 
robot for atomic reactor. It was also considered to be used as a substitution of fireman in 
rescuing activity: patrolling, gas detection, inspection and to rescue a person. This first 
hypermobile robot was large and heavy, weighing in at 350 kg. The robot comprised of 
multiple vertical cylindrical segments on powered wheels (tracks in KR-I) that gave the 
mechanism a train-like appearance. Vertical joint actuators allow a segment to lift its 
neighbors up, in order to negotiate steps or span gaps. Each segment of KR-II is equipped 
with single wheel, arranged so that the unit with wheel on the right side will come after a 
unit with the wheel on the left side. This single wheel design may seem unbalanced at the 
first glance but its stability is secured as the segments are linked. Especially, if the vehicle 
have the zigzag configuration. Moreover, this single wheel design has other advantages: 

• as each segment is connected to the body by 2DOF joint it may be seen as having sliding 
active suspension, 

• the adaptability to the steep inclination during traversing can be realized by shifting all 
wheels on one side up or down in vertical direction, 

• in addition, this design doesn't require the differential mechanism of the double wheel 
structure to permit different speed rotation on curves. 




Fig. 3. The Koryu-I (KR-I) robot on the left and KR-II on the right (reproduced from 
http://www-robot.mes.titech.ac.jp) 

These robots inherited all capabilities of earlier developed snake-like robots: 

• They can go on irregular terrain with sharp rises and falls and travel a path that winds 

tightly, 

They can cross over crevasses by holding its body length rigid to act as a bridge, 

In marshy and sandy terrain, it can move by distributing force along its entire body 

length. 
Additional active crawlers or wheels mounted on each segment give further advantages: 

High speed motion - direct propulsion is more effective then undulation, 

High load capacity - simple driving system gives large weight of load to its own weight 

ratio, 

Good portability by its unitized structure, 

High reliability, because it is made redundant - broken segments can be easily replaced 

and special segments could be added depending on mission, 
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• Versatility of the body motion - Koryu can be used not only for "locomotion", but also 

for "manipulation" - as claim authors (Hirose & Morishima, 1993). 
For motion control of the robot with several wheels touching ground the force sensors to 
detect reaction force were indispensable. To solve the problem, special construction of 
optical based force sensing was introduced. Such sensors were mounted in both vertical and 
horizontal axes to control robot among obstacles and on uneven terrain. 

More recently, Klaassen and Paap (1999) at the German National Research Center for 
Information Technology (GMD) developed the Snake2 vehicle, which contains six active 
segments and a head, as shown in Fig. 4. Each round segment has an array of 12 electrically 
driven wheels evenly spaced around its periphery. These wheels provide propulsion 
regardless of the vehicle's roll angle. Segments are interconnected by universal joints 
actuated by three additional electric motors through strings. Snake2 is an example of a robot 
that is inspired by the physiological structure of snakes where wheels replace tiny scales 
observed on the bodies of some real snakes. Snake2 is equipped with six infrared distance 
sensors, three torque sensors, one tilt sensor, two angle sensors in every segment, and a 
video camera in the head segment. Snake2 was specifically designed for the inspection of 
sewage pipes. With segments measuring 18cm in diameter and 13.5cm length Snake2 
belongs to the serpentine group. 



<*£* 




Fig. 4. Snake2 developed at the GMD 

This robot was a successor of the GMD-Snake, typical continuum spatial robot built in a 
very elastic way to allow flexible bending of the parts (Worst & Linnemann, 1996). 
However, Authors observed an uncontrolled torsion effect which occurred when the snake 
lifted some of its parts (to climb on a step). This disadvantage forced them to construct the 
next generation in a more rigid way using universal joints but leaving rope-based driving 
system. GMD snakes were the first articulated robots employing CAN bus communication 
in the distributed control system. 

Another hypermobile robot designed for sewer inspection was developed by Scholl et al. 
(2000) at the Forschungszentrum Informatik (FZI) in Germany. Its segments use only two 
wheels but the actuated 3-DOF joints allow full control over each segment's spatial 
orientation. The robot is able to negotiate tight 90° angled pipes and climb over 35 cm high 
obstacles. One segment and its joint are about 20 cm long each. The sensor suite of this robot 
is similar to that of Snake2. The development of sewer inspection robots was continued in 
the joint project MAKRO plus (Streich & Adria, 2004). 

MAKRO plus, is an autonomous service robot that can be used for a whole range of specific 
duties within a canalization system. Robot has symmetrical construction with head 
segments on both ends. These segments contain camera, structured light source and 
ultrasound sensor. Four-level hierarchical control system is proposed to autonomously 
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drive robot inside sewage pipes. The robot's mission is specified by human operator who 
determines entry and recovery points and downloads map of all pipes and manholes in the 
inspection area. Then the planning algorithm generates the sequence of actions, which are 
executed by action controller. In case of obstacle detection, blockage or malfunction, planner 
automatically finds new set of actions. Robot can be equipped in specialized modules. A 
chemistry module measures pH levels, conductivity, O2 and temperature of waste water 
with the help of a sample probe. When required, samples can be retrieved by the robot for 
further analysis in a laboratory. A navigation module which can record speed and three 
fiber optic gyroscopes measure the gradient and direction of the pipes in a canalization 
system. This helps to support the success of the mission and it is also provides a useful and 
accurate update for the land registry records, as informs INSPECTOR SYSTEMS Rainer 
Hitzel GmbH. 




Fig. 5. MAKRO plus robot for sewer inspection (reproduced from Streich & Adria, 2004) 

While wheeled serpentine robots can work well in smooth-walled pipes, more rugged 
terrain requires tracked propulsion. To this effect Takayama and Hirose (2000) developed 
the Souryu-I crawler, which consists of three segments, as shown in Fig. 6. 




Fig. 6. Souryu from Hirose Lab (reproduced from http://www-robot.mes.titech.ac.jp) 

Each segment is driven by a pair of tracks, which, in turn, are all powered simultaneously 
by a single motor, located in the center segment. Torque is provided to the two distal 
segments through a rotary shaft and universal joints. Each distal segment is connected to the 
center segment by a special 2-DOF joint mechanism, which is actuated by two lead screws 
driven by two electric motors. The robot can move forward and backward, and it can 
change the orientation of the two distal segments in yaw and pitch symmetrically to the 
center segment. Coordinated rotations of these joints can generate roll over motion of the 
robot. One interesting feature is the ability of this robot to adapt to irregular terrain because 
of the elasticity of its joints. This elasticity is provided by springs and cannot be actively 
controlled. The newest incarnation - Souryu-II - is designed to separate three bodies easily 
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so as to make it portable and to make it possible to add segments with special functions. 
Robot is equipped with camera and batteries, and can be remotely controlled. 
A different concept using unpowered joints was introduced by Kimura and Hirose (2002) at 
the Tokyo Institute of Technology. That robot, called Genbu (see Fig. 7), is probably the only 
serpentine robot with unpowered joints. The stability of the robot and its high mobility on 
rough terrain are preserved by large-diameter wheels (220 mm). The control system 
employs position and torque feedback sensors for the passive but rigid joints. Springs are 
used to protect the electric motors from impact, although the stiffness of the springs cannot 
be controlled during operation. Robot was intended mainly for two applications: as a fire- 
fighting robot to pull a fire hose or as a planetary rover. 




Fig. 7. Robot Genbu representing group of active wheels - passive joints robots 

Another robot incorporating a combination of passive and active joints as well as 
independently driven and coupled segments is KOHGA developed by Kamegawa et al. 
(2004) and shown in Fig. 8. Robot comprises 8 segments of different structure and function: 
two distal segments have CCD cameras mounted but have no propulsion means, the second 
units have the right and left crawlers which are driven co-accessibly, the other segments also 
have the right and left crawlers but independently driven. There is also a variety of joints 
implemented in this design: 

• Two 2DOF joints driven by simple RC servos to control position of heads with cameras, 

• Two 2DOF joints with powerful DC motors and linkages to rise two segments on either 
end, this improves the capability of climbing over obstacles, 

• Three 3DOF passive joints interconnecting main driving units, their function is to adjust 
robot's shape to the environment and efficiently transmit crawler force, they are passive 
for light weight and simplicity. 




Fig. 8. Robot KOHGA and KOHGA 2 (in a few configurations) from Matsuno Lab. at the 
University of Electro-Communications 
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This robot implements a smart design feature: besides a camera in the front segment, there 
is a second camera in the tail section that can be pointed forward, in the way a scorpion 
points its tail forward and over-head. This "tail-view" greatly helps teleoperating the robot. 
Operator is using SONY gamepad as user input and monitor with specially organized video 
outputs. Authors proposed also algorithm (based on robot kinematics only) to calculate 
speed of tracks and rotation of joints to realize follow-the-leader control of robot. 
KOHGA with its passive joints has an important problem that obstacles can be caught to the 
joints and then the robot is stuck. To solve this problem, the new reconfigurable version of 
KOHGA 2 was developed (Miyanaka et al. 2007). The unit structure consists of the crawler- 
arm-units, the joint-units, the terminal-units and the connecting parts. It can work as a self- 
contained module or can be connected with other units creating multi-segmented vehicle. 
Moreover, it can take various forms by the swing motion of the crawler-arms, and avoid 
various stuck conditions. These crawler-arms can be mounted in two ways: the rotational 
axes of right and left crawler-arms are alternately attached to the vehicle (called the non- 
coaxial type), or these axes are attached to the same end of the vehicle (called the coaxial 
type). Authors have considered several robot configurations, in different stuck-prone 
conditions, in both high- and low-ceiling environments. They concluded that (1) the ability 
of the stuck avoidance declines if the number of the connected vehicles is small because the 
performance of vertical step climbing falls off and (2) that the coaxial type robot is more 
effective to the stuck avoidance than the non-coaxial type robot. 

The concept of joining several small robots into a train to overcome larger obstacles was 
used by researchers from Carnegie Mellon University in their Millibot Train (Brown et al., 
2002). This robot consists of seven electrically driven, very compact segments. The diameter 
of the track sprockets is larger than the height of each segment, which allows the robot to 
drive upside-down. Segments are connected by couplers for active connection and 
disconnection, but the joints have only one DOF. Each joint is actuated by an electric motor 
with a high-ratio harmonic gear and slip clutch. It provides sufficient torque to lift up the 
three front segments. The robot has been demonstrated to climb up a regular staircase and 
even higher steps. However, with only one DOF in each joint the vehicle is kinematically 
limited. 

A serpentine robot that uses tracks for propulsion and pneumatics for joint actuation is 
MOIRA shown in Fig. 9 (Osuka & Kitajima, 2003). MOIRA comprises four segments, and 




Fig. 9. Robot Moira from Osuka Lab. 

each segment has two longitudinal tracks on each of its four sides, for a total of eight tracks 
per segment. All tracks are driven from a single motor through the system of 4 bevel and 4 
spiral gears and therefore they move in the same direction. With tracks on each side robot is 
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insensitive for rollovers and with additionally cone shaped distal segments robot can dig 
into the debris with obstacles touching it from all sides. The 2-DOF joints between segments 
are actuated by pneumatic cylinders. Although with pneumatic cylinders MOIRA can lift up 
its segments high enough to overcome obstacles, it can also decrease stiffness of actuators to 
nicely conform to the ground, but we think that with very long joints this design is prone to 
getting stuck on some narrow obstacles. Robot is controlled from the specially designed 
control box containing 3 joysticks and several switches. There is also view from nose CCD 
camera transmitted via USB. 

The newest construction from NREC (National Robotics Engineering Center) is Pipeline 
Explorer - robot designed and built for inspection of live gas pipelines (Schempf et al., 
2003). This robot, shown in Fig. 10, has a symmetric architecture. A seven-element 
articulated body design houses a mirror-image arrangement of locomotor (camera) 
modules, battery carrying modules, and support modules, with a computing and electronics 
module in the middle. The robot's computer and electronics are protected in purged and 
pressurized housings. Segments are connected with articulated joints: the locomotor 
modules are connected to their neighbors with pitch-roll joints, while the others - via pitch- 
only joints. These specially designed joints allow orientation of the robot within the pipe, in 
any direction needed. 




Fig. 10. Pipeline Explorer from NREC (reproduced from 
http://www.rec.ri.cmu.edu/projects/explorer) 

The locomotor module houses a mini fish-eye camera, along with its lens and lighting 
elements. The camera has a 190-degree field of view and provides high-resolution color 
images of the pipe's interior. The locomotor module also houses dual drive actuators 
designed to allow for the deployment and retraction of three legs equipped with custom- 
molded driving wheels. The robot can sustain speeds of up to four inches per second. It is 
fully untethered (battery-powered, wirelessly controlled) and can be used in explosive 
underground natural gas distribution pipelines. Construction of robot naturally limits its 
application to pipes of certain diameters. 

From 2002 to 2005 researchers from the Mobile Robotics Lab at the University of Michigan 
introduced the whole family of hypermobile robots called Omnis, shown in Fig. 11. In the 
OmniPede, the first one, they introduced three innovative functional elements: (1) 
propulsion elements (here: legs) evenly located around the perimeter of each segment; (2) 
pneumatic power for joint actuation; and (3) a single so called "drive shaft spine" that 
transfers mechanical power to all segments from a single drive motor (Long et al., 2002). 
From the study of the OmniPede, and from the observed shortcomings of this legged 
propulsion prototype, they derived important insights about the design of serpentine 
robots. These insights led to the development of the far more practical "OmniTread" 
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serpentine robot (Granosik et al., 2005). The OmniTread design offers two fundamentally 
important advantages over its predecessor and, in fact, over all other serpentine robots 
described in the scientific literature to date. These features are: maximal coverage of all sides 
of all segments with propulsion elements, joint actuation with pneumatic bellows. We 
believe that the bellows-based joint actuators used in OmniTread have a substantial 
advantage over a cylinder-based design, as discussed in Granosik & Borenstein (2005). 
This robot passed extended tests at South West Research Institute in Texas showing excellent 
performance on the send and rock testbeds as well as in the underbrush. It can climb 
obstacles 2.5 times higher then itself and span trenches almost half of own length. The latest 
version of the OmniTread is called OT-4 as it can fit through a hole 4 inches (10 cm) in 
diameter (Borenstein et al., 2006). The OT-4 is even more versatile then its predecessors, 
with onboard power sources (both electric and pneumatic) it can operate up to one hour, 
with wireless communication is completely tetherless, with clutches can precisely control 
power consumption, and with additional flipper-tracks can easily overcome the knife-edge 
hole obstacle and climb almost 5 times its own height. The detailed information on 
performance of all members of the Omnis family can be found in (Granosik et al., 2007). 




Fig. 11. The Omnis family of hypermobile robots from University of Michigan: OmniPede 
(upper left), OmniTread (lower left), OT-4 (right) 

Another example of reconfigurable hypermobile robot was developed by Zhang et al. 
(2006). The JL-I system, shown in Fig. 12, consists of three identical modules; actually each 
module is an entire robotic system that can perform distributed activities. Vehicles have a 
form of crawlers with skid-steering ability. To achieve highly adaptive locomotion 
capabilities, the robot's serial and parallel mechanisms form an active joint, enabling it to 
change its shape in three dimensions. A docking mechanism enables adjacent modules to 
connect or disconnect flexibly and automatically. This mechanical structure and the control 
system are intended to ensure optimal traction for assembled robot. Each module is an 
autonomous mobile robot capable of performing basic tasks such as search and navigation. 
In order to achieve all these functions, the control system of the robot is based on distributed 
architecture with wireless connection to the base station. This flexible system with several 
identical modules which can work separately or simultaneously when assembled, required 
hierarchical software, based on the multi-agent behavior-based concept. Robot showed 
ability to climb steps, span gaps and recover from any rollover situation. 
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Fig. 12. Reconfigurable robot JL-I developed by Zhang et al. (2006) 

3. Wheeeler project 

Our project is focused on precise modeling and simulation of hypermobile robot in a testing 
environment, and eventually building it in as simple as possible way to verify a high level 
control concept. As we observed from the literature review, most of the hypermobile robots 
presented to date lack the autonomy or intuitive teleoperation, or this autonomy is limited 
to very specific environment of operation. Although, every robot has some control system 
but in most cases they employ multi DOF joysticks (Osuka & Kitajima, 2003) or 
sophisticated user interfaces (Baker & Borenstein, 2006), or require more then one operator. 
Our goal is to simplify teleoperation of these robots and increase their applicability. We 
consider articulated mobile robot propelled on wheels and therefore called - Wheeeler. We 
start with precise modeling of Wheeeler and designing the most intuitive user interface to 
control it. Then we show some mechanical details of suspension system and proof-of- 
concept prototype containing 3 identical segments built with many off-the-shelf components 
used in RC models technology. 

3.1 Modeling and simulation 

In this stage of a project we have used Webots PRO simulation software to model robot and 
working environment (Michel, 2004). Applying masses, inertias, friction coefficients and 
damping made model and simulation very realistic (see Fig. 13). Webots relies on ODE 
(Open Dynamics Engine) to perform accurate physics simulation and uses OpenGL 
graphics. 

We assumed that robot will contain 7 identical segments interconnected by 2DOF joints 
allowing pitch and yaw rotations. Each segment has its own drive and suite of sensors 
including: 4 distance sensors facing up, down, left and right; encoder on main motor, 3 axis 
accelerometer and single axes gyro. We also assumed position feedback from joints and 
vision feedback from two cameras mounted on both ends of robot. With these two cameras 
robot will have advantages similar to Kohga robot providing operator with view from the 
nose camera and perspective view from behind and above the robot when tail of Wheeler is 
lifted in scorpion-like manner. 
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Fig. 13. Model of Wheeeler in Webots PRO simulator (Pytasz & Granosik, 2006) 

In simulation environment we have access to information from all sensors. This data is 
processed by robot controller and streamed to the client-operator. CORBA framework has 
been used as communication layer. The choice was made because of its portability and 
flexibility and detailed explanation can be found in (Pytasz & Granosik 2007). With robot 
development and sensory suite extension the larger amount of data had to be transferred 
over network, including: 

• control commands to robot, 

• sensor data from robot, 

• video streaming. 

The selected mechanism allows for easy extension of communication features, decreasing 
probability of programming errors to occur. The same data structures and transport 
mechanisms as used in simulation will be verified in real robot. 

Inter-segment joints working in vertical direction have a range of movement close to ±90, in 
horizontal it is a little over ±45. These ranges combined with short segments and zigzag 
posture of robot (e.g. as shown in Fig. 14) can compensate for lack of all side tracks (known 
from Moira or OmniTread). When rotation of upper wheels is opposite to the lower wheels 
robot is able to enter pipes, shafts, or low ceiling environments. 




Fig. 14. Zigzag configuration of Wheeeler for working in pipes, shafts and low ceiling 
environments. 
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3.2 Mechanical concept 

Designing Wheeeler we tried to take the best features of other hypermobile robots but we 
have also borne in mind that mechanical construction has to be simple. Impressed with 
behavior of Genbu - even with passive joints this wheeled robot can evade stuck situations, 
and envisaging easy method of driving two wheels on the common axel, we decided to use 
light-weight plastic wheels known from Monster Truck models (see Fig. 15). With big 
wheels, larger then segment's interior robot can easily ride upside down. The driving 
module uses single RC servo motor and bevel gear to transmit rotation directly to both 
wheels. We decided not to use differential mechanism to reduce weight and simplify 
transmission system. Moreover, for off-road vehicles it is much better to have the same 
speed and torque on both wheels to ensure grip on rocks or send, even though skidding will 
appear on the curves. 




Fig. 15. Proof -of -concept prototype of Wheeeler - 3 segments (out of planned 7 segments) 
connected with 2 DOF joints 

Hypermobile robots should conform to the terrain compliantly, so that as many driving 

segments as possible are in contact with the ground at all times to provide effective 

propulsion. Based on the literature review in the previous section and as shown in Table 4 

there are, in general, three methods of adapting of articulated mobile robots to the rugged 

terrain. They are: active control of joints of the robot, passive joints and naturally compliant 

joints. The first method requires exact sensing of contact forces between propulsion means 

and the ground. Based on these measurements in each segment, control algorithm has to 

adapt joint position accordingly. 

In case of robot Koryu the impedance control is proposed (Hirose & Morisima, 1993). It was 

shown that based on force sensing in both vertical and horizontal direction robot can follow 

the curvature of the ground and surmount higher, vertical obstacles. 

For motion control of robot Makro: like branching into pipes, overcoming obstacles and 

driving at the bottom of the sewer, combination of 2D inclinometers (or 2 axes 

accelerometers) with joint angles sensing was proposed. 

Active connection mechanism of Souryu introduces special elasticity. Rolling deformation 

is generated by the vehicle's own dead weight and does not interfere with proper operation 

of other DOF in joint thanks to combination of soft and hard springs. This elasticity also 

absorbs impacts. Pitch and yaw are also secured from impacts by additional springs. 

In the robot Kohga full 3 DOF passive joints are used in the middle part of the robot - 4 

central segments are interconnected by combination of freely rotating universal joint and 
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additional roll joint. This design allows the robot to adjust its shape corresponding to 
irregular surfaces, so that the driving force can be transmitted to the ground more 
effectively. The units, which are connected with passive joint, are controlled by driving the 
right and left crawlers independently instead of manipulating the joint. 

Pneumatically driven joints of Moira and Omnis robots employ natural compliance of 
pneumatic springs. The advantage of using pneumatic springs instead of earlier mentioned 
solutions is possibility of controlling their stiffness actively. Regulating the level of pressures 
in chambers of the cylinders (driving Moira's joints) or in each of 4 bellows constituting 
Integrated Joint Actuator of OmniTread (or OT-4) changes spatial compliance of this joint. 
Therefore, theses robots can nicely conform to rock beds when joints go limp and a second 
later they can span gaps after pumping pressure up. Of course, the drawback of this 
solution is need of additional (pneumatic) source of power - increasing weight and noise. 
We decided to combine active and stiff joints with passive suspension system in each 
segment and soft tires. Driving module (described earlier) is mounted in the base of the 
segment of Wheeeler rotationally in such a way, that it can rotate ±10° over longitudinal axis 
of the robot. This angle is measured with rotary potentiometer. Axis of wheels is supported 
by two ball-bearings in the funnel, which at ends is connected with the body of segment 
using 4 springs (Granosik et al., 2008). 

We are very pleased with the behavior of Wheeeler's passive suspension, which helps to 
travel in an uneven terrain, as shown in Fig. 15, preserving continuous contact with ground 
for all wheels of the robot. Even for obstacles as high as half of wheel's diameter, springs 
allow each segment to conform to the ground and provide good grip for all tires. 
In Fig. 16 we can see the behavior of springs depending on the position of wheel with 
respect to the floating platform. If wheel is lifted up by an obstacle springs extend as shown 
on the left part of Fig. 16, while springs on the wheel which is lower are compressed (right 
part). 




Fig. 16. Close view of springs in passive suspension of Wheeeler during riding over obstacle 

3.3 Electronics 

In order to control all the sensors envisioned in Wheeeler and in order to simply mechanical 
fit into the segment we have designed and built specialized controller based on the 
AT90CAN128 (Atmel), as shown in Fig. 17. We have chosen this processor for the fast AVR 
structure and relatively high processing power, as for 8-bit controllers. Additionally, in- 
system programming from Atmel offers reprogramming of each processor of the system 
directly through CAN bus. This will simplify development procedure of the robot's lowest 
level software. Local controllers are augmented with all necessary peripherials: 3-axis 
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accelerometers LIS3LV02DL (STMicroelectronics), single axis gyroscope ADIS16100 (Analog 
Devices), quadrature counters LS7366R (LSI/CSI) and IR distance sensors GP2D120 (Sharp). 
Functionality of controller can be further extended through serial communication interfaces: 
CAN, SPI, I2C and RS232. CAN bus is used as a main communication means for data 
acquisition and robot control. 

natog inputs 

hot 




Fig. 17. Local controller mounted on each segment of Wheeeler 



3.4 Controller 

Local controllers are daisy-chained along robot's body and connected to the main controller 
realized on PC104 computer, as shown in Fig. 18. This main controller gathers data from 
robot and forwards to operators station via wireless link. It will also be used to transfer 
video signal. In the opposite direction, control orders comes from operator, they are being 
analyzed in main controller and distributed to local ones. We are planning that main 
controller will be also responsible for basic autonomous behavior of Wheeeler. 
At first, basic teleoperation with only a visual feedback was introduced. Communication 
was unidirectional, allowing client (operator) send one of the following instructions: 

• new angular velocity of the axle of specified segment, 

• new position of the horizontal or vertical joint of a segment, 

• stop all segments. 

This form of control would be very inconvenient in a real application; therefore a simple 
propagation algorithm for angular position of joints was introduced. This algorithm is 
usually referred as follow the leader approach and is most often used to control serpentine 
and snake-like robots (Choset & Henning, 1999). However, this method requires very strong 
assumption that we know exact value of robot's speed with reference to the ground. 
Unfortunately, in most cases where hypermobile robots are intended to operate this 
condition is not fulfilled due to slippage, skidding on rocks or hitting obstacles. To improve 
operation of hypermobile robot among obstacles we are combining accelerometers' readings 
with potential field method based on the measurements of distances from each segment to 
the nearest obstacle. Using accelerometers we can detect wheel slippage and correct velocity 
accordingly. Using IR sensors we can check surroundings of the robot in four directions: 
up, down, left and right, and correct robot's trajectory according to these measurements. 
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Fig. 18. Wheeler control structure 

3.5 Tests 

After building the proof-of-concept version of Wheeeler, consisting of 3 segments, we have 
made some preliminary tests to verify robot's behaviour, power consumption and 
performance. Results are presented in the following tables. Table 2 compares speed of the 
robot on the flat terrain (carpet) and supply current measured for three levels of supply 
voltage. Table 3 shows the current consumption during driving on the inclined steel flat 
surface (measurement for two inclinations and three voltage levels). 



Supply voltage [V] 


Speed of robot [cm/s] 


Starting current [A] 


Nominal current 
[A] 


5 


32.2 


1.7 


0.7 


6 


40.0 


1.7 


0.7 


7 


46.0 


2.3 


0.9 



Table 2. Performance of 3 segment Wheeeler on flat terrain 



Inclination [deg] 


Supply voltage [V] 


Starting current [A] 
Going up (down) 


Nominal current [A] 
Going up (down) 


16.2 


5 


2.6 (1.5) 


1.4 (0.3) 


6 


2.7 (1.6) 


1.5 (0.3) 


7 


2.8 (1.6) 


1.7 (0.5) 


21.3 


5 


2.6 


2.0 (0.2) 


6 


2.7 


2.0 (0.3) 


7 


2.8 


2.0 (0.4) 



Table 3. Power consumption of 3 segment Wheeeler on inclined surface 



4. Conclusion 

We have made an extended literature review in order to analyze methodologies used in 
designing and building hypermobile robots. We have also included our own experience in 
this field coming from Omnis project and recent Wheeeler development. The most 
important information of each hypermobile robot is summarized in Table 4. 
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1. Introduction 

Finally the paper deals with the solution of the following problem: 

Let signals yi(iT), ..., yq(iT) (elements of output vector y(iT)) be responses of MIMO discrete- 
time system to the input signals ui(iT),..,u q (iT) (elements of input vector u(iT)). Determine 

discrete-time excitations uai, ,UAq (elements of vector ua) scaling the system responses to 

the required forms yi(iA- 1 T),...,yq(iA- 1 T), where A is positive number called "time scale 
coefficient", T-reference sample time, i-sample number. 

If A>\, then system responses are speeded up comparing them to reference responses 
yi(iT),..., y q (iT). Putting 0<A <1 one slows the responses down. The perfect scaling of system 
makes, that moments of appearances of consecutive amplitudes belonging to the reference 
responses yi(iT),..., y q (iT) are "controlled" according to value of A. The scaling of system 
responses yields transparent representations in time and frequency domains. Thus, the time- 
scaling seems to be useful tool for speeding the system responses up, or slowing them 
down, if forms of reference responses satisfy technical needs. Let us note, that perfect 
scaling of stable system conserves its stability. 

The methods presented in current Chapter can be applied for time-scaling of responses of 
continuous-time systems on the basis of respective discrete- time models. Thus, the 
considered methods can be treated as alternative solutions to those for time-scaling of 
continuous-time systems (Durnas & Grzywacz, 2001; Durnas & Grzywacz, 2002; Grzywacz, 
2006). Of course, there are other ways of application of time-scaling techniques, like those, 
for example, presented in (Moya at al., 2002; Respondek & Pogromsky, 2004), where 
problem of synthesis of controllers and observers for certain classes of non-linear plants has 
been solved. In current Chapter we do not deal with time-scaling in this meaning. 

2. The idea of method for SISO systems 

2.1 Slowing the system response down 

Let us consider the class of SISO nonlinear, discrete-time systems of order n defined for 
sample time T by state equations system: 
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xi(i+l) = x 2 (i) 
x 2 (i+l) = x 3 (i) 



(1) 



x„(i+l) 



■■ f[xi(i), x 2 (i),...,x„-i(i)] + u(i) 

y(i) = F [b xS) + b 1 x 2 (i) +... b n -i x„(i) ] 

where xi(i), x 2 (i),..., x„(i) - components of state vector x(i), /and F represent static nonlinear 
functions, u(i) - system input, y(i) - system output, i - sample number. In particular case, if 

F [.] = b xt(i) + bt x 2 (i)+ + b„_ 2 x„(i) and f(. ) = - (a Xi(i) + + en x 2 (i) +...+ a n .-i x„(i)), the 

system (1) becomes to linear one given by transfer function: 



G(z) 



J n-\< 



,H-1 



b[Z 



(2) 



The simplified scheme of system (1) is shown in Fig.l. In order to slow system response 
down a times (a positive integer number), we can join the element Z' k , where k=(a -1), to the 
all single unit delay elements z - 1 belonging to the system shown in Fig. 1 (i.e. each z - 1 in 
"primary" structure is replaced by z~< k+ V). Additionally, if we substitute the reference 
excitation u(iT) with the new one u(iT'), where T = aT, then both above modifications 
change the system response to the form y(iT'). This means, that form of reference response 
y(iT) is conserved perfectly. If system (1) represents real plant or its model which is used to 
calculation of signal for plant control, then "internal" modification of structure in Fig. 1 
cannot be done. Thus, the time-scaling of system in Fig. 1 has to be done by suitable forming 
of its excitation u(i), without modification of its internal structure. 



u(i) 




Fig. 1. The simplified scheme of system (1): z' 1 - chain of unit delay elements, x(i) -state 
vector with components representing outputs of single delay elements, F and /- static 
nonlinear functions, u -input, y - output. 

To solve the task of scaling one can try to do reconfiguration of modified structure like in 
Fig. 1 (that containing elements z -< H1 > instead of z - 1 ) to the structure shown in Fig. 2, where 
SYSTEM represents "primary" structure shown in Fig. 1 (from before any modification). 
The relevant calculations allow to express the operation realized by "INPUT" corrector in 
the form: 



Time-Scaling of SISO and MIMO Discrete-Time Systems 



335 



G z (z)-- 



i nk+n-k-1 . • nk+n—2k—2 

b n _ x z +b n -2Z 



-b lZ k+1 +b 



o 



.ilk 



nk+\ 



V2 Z ' 



_nk+n—2 



J n-l z 



_nk+n-\ 



(3) 



The output signal of „FEEDBACK" corrector is formed as it follows: components of state 
vector x(i) are passed through the transfer function (Gz(z))- 1 . This creates auxiliary signals 
x'i(z) =[(Gz(z))- 1 xi(z)], ...., x'„(z) = [(Gz(z))- 1 x„(z)]. Then, on the basis of signals x\(i) , ...., x'„(i) 
one can create signal Uj(i) according to formula: 

Uj(i) =f[x\(i -n k), x' 2 (i-(n-l) k) x'„(i -k)] (4) 

Note, that operation (4) processes the delayed signals x\(i), ....,x' n (i). Next signal ufi) has to 
be passed through the transmittance G z (z). Its output signal is w/g(X). Finally, the output 
signal Uf(i) of "FEDDBACK" corrector which "fits" for scheme shown in Fig. 2. is composed 
using expression: 



Ufi) = UfG(i) - f[ Xl (i), X 2 (i) Xn.1® ] 



(5) 



The input signal of structure shown in Fig. 2 ought to be u(iT'), which means, that one has to 
repeat the form of reference input signal of system shown in Fig. 1 (i.e. u(iT) for A- 1 =a=l), 
however the consecutive amplitudes of signal u(iT') are changed now at moments (iT). Let 
us note, that both systems (those in Fig. 1 and Fig. 2) operate with the same sample time T. If 
order of denominator of G z (z) is higher than order of nominator, then we can use function 
G' z (z)=(z d G(z)) instead of G(z), where d represents the difference between denominator 
order and nominator order. This "deviation" does not affect the form of output signal. It is 
merely shifted forwards d samples T. 



i(iA-iT) 



„INPUT" 
CORRECTOR 



U A 



U f 



SYSTEM 



x(iT) 



y(iA-iT) 

► 



V 



„FEEDBACK" 
CORRECTOR 



Fig. 2. The time-scaling of linear and nonlinear SISO systems: m(z'A- 2 T) - scaled reference 
excitation, ua - system input scaling its response, y(iA~ 1 T) - scaled reference response y(iT), 
x(iT) - system state vector. 

Example A. Let us consider the second order system (6) belonging to class (1): 

Xi(i+1) = x z (i) 

x 2 (i+l)= f[x 1 (i),x 2 (i)] + u(i) (6) 

y(i) = F [b X!(i) + b 2 x 2 (i) ] 
where f(.) = -0.72 Xi(i) + 0.5 (x 2 (i)p , F(.)= | 2x 2 ft> + 5x 2 (i) | . 
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The response of system (6) to signal u(i) = l(i) for T = Is is shown in Fig. 3. Let us assume, 
that form of that response fulfils the technical needs, however it is "too speedy" and should 
be slowed down 2 times. Hence A=0.5, n-2, a =2 k=l, bo=2, bi= 5. Putting k=\, n=2 to (3) 
one obtains the INPUT corrector (see Fig. 2) in the form: 



G z (z) = 



5 z z + 2 



5 z' 



2 z' 



(7) 



To design FEEDBACK corrector we have to use (Gz(z)y 1 . Inversion of G(z) yields transfer 
function with order of numerator higher than order of denominator. However, this is only 
the apparent difficulty. Instead of G(z) we can put the invertible transfer functions G z '(z) = 
[z G z (z)] with identical orders of nominator and denominator to the FEEDBACK and INPUT 
correctors. This "deviation" does not affect the form of output signal. It is merely shifted 
forward one sample time period T. 

y 




5 10 15 20 25 



10 15 20 25 



Fig. 3. Left position: the reference response of system (6) to signal u(i)~ l(i) - curve 
distinguished by "." - and response of system (6) obtained as output signal of structure in 
Fig. 2 - curve distinguished by "o" . Note, that consecutive amplitudes of both responses are 
identical (compare chosen referring amplitudes indicated by arrows). Right position: 
reference input signal u(i) = l(i) and input signal haO) of system (6) when included to 
structure in Fig. 2 as SYSTEM. 

The FEEDBACK corrector (see Fig. 2) for system (6) and assumed values of A, k, n can be 
defined by means of G z '(z) =[z G(z)] and nonlinear functions 



Ufi) -flx'iG -2), x' 2 (i-l)] = - 0.72 x'i(i -2) + 0.5(x' 2 (i-l))3 
Ufi) = u fG (i)-f[x 1 (i),x 1 (i)] 



(8) 



where signal Ujc(i) is generated by passing of signal Uf(i) through the transfer function 
Gz'(z). The simulations carried out for structure like in Fig. 2 and data referring to current 
example yield results shown in Fig. 3. We can observe, that described above algorithm 
allows to design the INPUT and FEEDBACK correctors which guarantee the perfect result 
of scaling. 
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2.2 Speeding the system response up 

Let model (1) be used for calculation of system input signal which speeds its reference 
response up A times (A - positive integer). If primary model (1) as well as system excitations 
are defined for sample time T, then one cannot speed its response up without shortening of 
sample time. We can treat, that operator z -1 associated with sample time T can be expressed 
as Z" 1 = Z- A , where Z' 1 refers to delay T'=A~ T. Thus, we can imagine, that each unit delay 
z - 1 of "primary" structure in Fig. 1 is substituted by Z~ A . The described transformations of 
primary model (1) make, that modified structure operates with sample time T", although 
system model (1) has been defined for sample time T. Now, if we remove k = (A-l) elements 
Z _1 from each single "cell" z -3 = Z~ A , then we obtain structure like in Fig. 1, where operators 
z - 1 are replaced with Z~ 2 . If we put scaled signal u(iT') to input of that modified structure 
(instead of primary u(iT)), then response of modified system will be speeded up A-times and 
form of reference output (from before the described modification) will be conserved. 
However, dealing with discrete-time models of real plants used for calculations of signals 
for control purposes we cannot manipulate "inside" of structure modeling the plant and all 
manipulations ought to be done "externally" in relation to plant model "interior". 
To solve the problem of scaling by synthesis of suitable input signal we can try to do 
reconfiguration of modified structure like in Fig.l (that containing Z - 1 instead of z - J ) to the 
structure shown in Fig. 2, where SYSTEM represents the "primary" structure shown in Fig.l 
from before modifications described above. Like in case taken into account in Section 2.1 the 
current task can be solved similarly, due to equivalent transformations of modified structure 
aimed at separation of its subsystem, which obtains the form of "primary" system, that 
shown in Fig. 1, with unit delays z _1 replaced by chain of A single delays Z- 1 . The equivalent 
transformations are based on property Z- 1 = z~ 1 Z k , where k = (A-l). The calculations allow 
to define the following operation realized by "INPUT" corrector: 

r f7) _ b Z nk Z° + b x Z^ k Z 2A +.... + b nA Z k Z 2A ("- l > 

z() ~ b 0+ b l z k+l + .... + b n _ l z(«- l >( k ^ () 

The output signal of „ FEEDBACK" corrector can be generated as it follows: the shifted 
forward components of state vector, those corresponding to sample time T, i.e. xi(i+nk), 
X2(i+(n-l)k),...,x„(i+k) should be passed through the transfer function (Gz(Z))- 1 . This operation 
yields the signals x'i(i+nk), x'2(i+(n-l) k),...,x'„(i+k) respectively. Using these auxiliary signals 
one can form signal ufi); 

uf(i ) -f[x' i(i + n k), x' 2 (i+ (n-1) k) x'„(i +k)] (10) 

Note, that shifted forwards components of vector x are available, because they are generated 
as output signals of "former" delay elements Z' 1 creating chain of delays used for 
assembling the model (1) or can be generated by delaying of "former" state variables 
associated with model (1). Next signal ufi) has to be passed through the transfer function 
G 2 (Z). Let output signal of G Z (Z) be denoted by up(i). Finally, the output signal Ufi) of 
"FEDDBACK" corrector (see Fig. 2) is composed using formula : 

Ufi) = ujcti) - flXl(i), X 2 (i) Xn-l(i) ] (11) 
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The described rules allow to scale exactly the output y, if SYSTEM "pretends" operation 
with sample time T (because of substitution z -1 = Z~ A ) and its real sample time is T'={A- 1 T). It 
is obvious, that one cannot speed up the reference output y(iT), conserving its consecutive 
amplitudes, if system sample time is still T. However, dependently of A, we can remove 
every second, or every third, etc. amplitudes from reference sequence y(iT) and consecutive 
remaining amplitudes generate with sample time T. This way can be treated as simplified 
one for speeding the SYSTEM response up (some amplitudes of reference response are not 
repeated), if sample time T of primary SYSTEM cannot be shortened. To obtain simplified 
result of speeding of reference response up we can use signal ua (Fig. 2) which is generated 
exactly like it was described above for SYSTEM operating with sample time T". However, 
signal ua determined for T" should be passed through the "ZOH" (zero order hold) element 
defined for sample time T. Then output signal of "ZOH" operation should be put as 
excitation to input of "primary" SYSTEM, i.e. that operating with sample time T. It means, 
that SYSTEM operating with T" in structure like in Fig. 2 should be treated now as 
calculation model of real primary SYSTEM operating with the sample time T. 
Example B. Let us consider once more the second order system (6). The reference response 
of system (6) to signal u(i) = l(i) for T =1 has been shown in Fig. 3. Let us assume, that form 
of considered response fulfills the technical needs, however it is "too slow" and should be 
speeded up 2 times. Hence A=2, tl-2, k=l, bo=2, bi~ 5. Putting k=l, n=2 to (9) one obtains the 
INPUT corrector (see Fig. 2) in the form: 

G Z (Z)= 5Z3 + 2Z (12) 

5Z 2 + 2 

To avoid of processing of transfer function with higher order of nominator than order of 
denominator we can use transfer function G Z '(Z) = /Z -2 G Z (Z)] with identical orders of 
nominator and denominator. This "deviation" does not affect the form of time-scaled output 
signal. It is merely shifted backwards one sample period T". The FEEDBACK corrector (Fig. 
2) for system (6) and assumed values A,k,n can be defined by means of G Z '(Z)=[Z~ 1 G 2 (Z)] 
and nonlinear functions: 

Uf(i)=f[x'i(i +2),x' 2 (i+l)]= - 0.72 x'i(i +2) + 0.5(x' 2 (i+l)P 



Ufi) = u fG (i) -flxi(i), x 2 (i)] (13) 

where signal Ujc(i) is generated by passing the signal ufi) through the transfer function 
Gz'(Z). The scheme of system for speeding the response of system (6) is shown in Fig. 4. The 
exemplary simulation results are shown in Fig. 5. 

The result of simplified scaling of system (6) is shown in Fig. 5 as well (see lowest position). 
This result has been obtained by passing of signal Ua generated in system shown in Fig. 4 
through the "ZOH" operation defined for primary sample time T. After that, the output of 
"ZOH" operation has been used as excitation of primary SYSTEM (6), that defined for 
sample time T. 
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Fig. 4. The scaling of system (6) according to Fig. 2 and matter of Example B. Note, that 
primary delays T (for z _1 ) are substituted with the two delays T" = 0.5 T, where T" refers to 
operator Z - 1 . 



2.3 Generalization 

The considerations in previous Sections have been carried out for A being fraction with 
numerator equal to one (for slowing the response down) or A being integer number (for 
speeding the response up). The identical reasoning can be repeated, if each operator of 
primary unit delay z _I in system shown in Fig.l is substituted with the chain of a operators 
Z' 1 associated with delay T'=T/a and next each single unit delay z- 1 =Z' a is substituted with 
(Z" a Z~P ) for slowing the response or with (Z" a Z^ ), where ft < a , for speeding the 
response. All farther transformations aimed at separation of primary system structure (from 
before scaling) can be done exactly like it has been described in previous Sections. Thus, it is 
easy way of scaling if one needs to slow the response down [(a+P)/a] times or speed it up 
[a/(a-(3)] times. Of course, the obtained final structure (like in Fig. 2) has to be supplied with 
scaled reference excitation u. 

Looking for other possibilities of scaling of discrete-time systems we can adjust algorithms 
for scaling of continuous-time systems (Durnas & Grzywacz, 2001; Durnas & Grzywacz, 
2002; Grzywacz, 2006). The system (1) can be or can be treated as it would be the discrete- 
time model of certain continuous-time system. So, one can determine the continuous-time 
model associated with system (1). Then obtained model can be scaled like continuous-time 
system. Finally, the respective scheme for scaling of continuous-time model can be 
transformed back into discrete-time scheme. We must honestly admit, that scaling via 
continuous-time representation does not guarantee the same consecutive amplitudes of 
discrete-time output signals for variety of values of A. Nevertheless, we can treat this way 
as determination of approximate solution. 
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Fig. 5. Upper figure: the reference response of system (6) to signal u(i)=l(i) - curve 
distinguished by "." - and the speeded response of system (6) obtained as output signal of 
structure in Fig. 4 - curve distinguished by "o". Note, that consecutive amplitudes of both 
responses are identical (compare chosen referring amplitudes indicated by arrows). Below: 
reference input u(i)=l(i) and input signal Ua(i) of system (6) when included to structure in 
Fig. 4 (or in Fig. 2) as SYSTEM. The lowest position: the speeded response of system (6) 
obtained as output of structure in Fig. 4 - curve distinguished by "o" - and simplified result 
of scaling with sample time T - heavy line. Note, that every second amplitude of perfectly 
scaled output y(iT') is equal to respective amplitude of simplified result of scaling for 
sample time T. 



3. Time-scaling of MIMO systems 

Let us assume, that MIMO system can be modelled by "V" structure (Chen, 1983) shown in 
Fig. 6, where V m j are linear or nonlinear operations processing the input or output signals 
respectively. To scale the "V" structure one has to replace its SISO elements V m j (Fig. 6) by 
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respective SISO structures shown in Fig. 2 (where V m j has to be treated as "SYSTEM" - see 
Fig. 2 - and elements Vi m j, Vr m j represent operations of associated INPUT corrector and 
FEEDBACK one respectively). After the above modification of primary "V" structure one 
can arrange the set of equivalent scheme transformations aimed at separation of primary 
"V" structure. This goal can be achieved and signals Uai, ua2 scaling the outputs y\, y 2 can 
be generated by equipping the system with external correction loops. It means, that internal 
structure of system is not affected. If the reference signals Ui(iT), U 2 (iT) exciting the system 
(see Fig. 6, Fig. 7) are substituted with signals uai , Uaz,'- 

uai = Vm (ui(i A- 1 T) + V A n{]ji)) + V F11 (x n , yi) - Viiiyi) 



ua2 = Vm (u 2 (i A- 1 T) + V A u(yi)) + V F2 2 (x 22 , yi) - V 12 (yi) 



(14) 



. 






+ 
+ 


— ► 


V n 




1 






Ul | l — ► 














yi 












V u 




























Vn 






















+ 
+ 






yi 


U2 




— ► 


V 22 




















SYSTEM 





Fig. 6. The exemplary V-structure for 2 inputs and 2 outputs. 

then system responses to these signals will be yi(iT') and y2(iT'), instead of yi(iT) and J/26X) 
from before correction. Thus, the system output signals are scaled perfectly. The operations 
realised by INPUT correctors associated with elements Vu and V22 (see Fig. 2) have been 
denoted in (14) by Vm,Vm- The operations realised by FEEDBACK correctors associated 
with Vu and V22 have been denoted by Vfu, Vf22 ■ The operations responsible for interaction 
(Fig. 6) are denoted by V2i,Vu- The time-scaled interaction operations V21 and V12 are 
denoted by V A2i and Vau (if/ for example, r 1 (iT)=V 2 i(y2(iT)), then r 1 (iT')=VA2i(y2(iT'))). 
Finally, the state vectors chosen for Vu and V22 are denoted by Xu(i), X2i(i). If necessary, one 
can generalise (14) to the form covering the required number of inputs and outputs: 



ua, 



-V u 



, C u„ l (iA- 1 T)+ 2_, V A jm(yj) ) + V Fmm (x mm ,y m ) - 2^ Vjmfyj) , m =l,.,q, ]=l,..,q (15) 



/=1 
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The generalization (15) is applicable, if SISO components of "V" model of MIMO system can 
be scaled in structure shown in Fig. 2. This means, that presented rules of scaling for SISO 
systems can be useful for scaling of MIMO systems. 
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Fig. 7. The synthesis of signal Um for scaling of SYSTEM output signal yj . To avoid the 
illegibility of scheme the sub-structure generating signal uai has not been attached. Note, 
that effect of time-scaling has been achieved without modification of internal structure of 
SYSTEM. 

Example C. For simplicity, let us consider the linear, continuous-time MIMO system where 
V u (s) = [6 s 2 + s +1] -\ V Z2 (s) = [12 s 2 +4s+l] -\ 

V 21 (s) = - 10 s2, V 12 (s)= - 5s 2 . (16) 

Let us assume, that forms of system responses i/i(t), \)i(t) to excitations Ui(t), u 2 (t) fulfil 
technical needs and ought to be conserved. However, they are too "fast" and yi(t), y 2 (t) 
have to be slowed down 2 times (A=0.5). Moreover, let us assume, that one cannot change 
the system parameters. The solution of this task can be found using the idea of time-scaling, 
for example, by processing the discrete-time V-model of system. Using Tustin's transform 
one obtains: 



Vu(z)=0.0833 T 2 (z 2 +10 z +1) (Mii(z))' 1 , V 22 (z)= 0.0833 T 2 (z 2 +10 z +1) (M 2 z(z))- 1 , 
V n (z)= -120 T- 2 (z 2 -2z+l)(z 2 +10z+l)-\ V 12 (z)= -60 T 2 (z 2 -2z+l)(z 2 +10z+l)-i , 



(17) 



where : 



Mn(z) = (6+0.5T+0.0833T 2 ) z 2 + (0.8333T 2 -12) z+0.0833 T 2 - 0.5T- 6, 
M 22 (z) = (12+2T+0.0833T 2 ) z 2 + (0.8333T 2 - 24) z + 0.0833T 2 - 2T-12. 
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The responses of system discrete model for sample time T=ls are shown in Fig. 8. The 
excitations of system have been obtained using (14). The elements V m j(z) of structure in Fig. 
6 have been scaled according to rules described in Section 2.1. 
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Fig 8. Result of scaling of outputs yi(t), )ji{t) (solid lines - responses of system to 
excitations uaiMai , staircase lines- responses of discrete-time model of system) and 
discrete-time signals uai, ua2 supplying system . 



4. Conclusions 

1. The scaling of SISO systems is realised by equipping them with INPUT and FEEDBACK 
correctors (Fig. 2). It makes, that external loops process the system output signal as well as 
system state variables and internal structure of system is not affected. 

2. The scaling of MIMO system is based on its "V" model and can be implemented, 
if linear and nonlinear SISO elements of this model can be scaled using SISO structure 
shown in Fig. 2. 

3. The presented algorithms can be successfully applied to discrete-time models of 
continuous-time systems (see Example C). Thus, the discrete-time algorithms can occur 
helpful for time-scaling of responses of continuous-time systems. 

4. The perfect scaling of stable system conserves its stability. Furthermore, the frequency 
representations of signals and systems after scaling can be immediately determined on the 
basis of reference representations (from before scaling). 

5. The considerable number of parameters representing system and associated signals in 
time domain and frequency domain can be treated as invariant while system dynamic 
properties are changed according to rules of time scaling. This advantageous feature allows 
to design the advanced control algorithms which conserve the crucial parameters 
characterising the control process (overshoots, steady state errors, stability margins, etc.) if 
plant responses have to be slowed down or speeded up. The control algorithms using idea 
of time-scaling process the plant state variables. That is why the well known MFC structure 
can be advised for implementation purposes (Durnas & Grzywacz, 2001; Durnas & 
Grzywacz, 2002; Grzywacz, 2006). 
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6. The attractive area of applications of idea of time-scaling seems to be that dealing with 
algorithms for control of robot drives. If displacements along X,y,Z - axes are scaled with 
the same A, then forms of space trajectories of robot movement for various A are identical 
(they do not depend on speed of robot movement). 
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1. Introduction 



Linear time-varying (LTV) models have a niche of their own in the representation of systems 
whose behaviour may be fairly approximated as linear but whose parameters are not 
constant or when their behaviour shows some other properties not displayed by linear time- 
invariant (LTI) models. LTV models have been applied successfully in a wide range of 
problems ranging from system modelling and control (see, for instance, Benton & Smith, 
2005; Kim et al., 2005; Vaishya & Singh, 2001) or electronic circuit design (Darabi et al., 2004) 
to less common applications such as the modelling of soil carbon dynamics (Martin et al., 
2007) or the analysis of the stability of oscillatory (bio)chemical systems (Zak et al., 2005). 
A common problem found in systems science and which is relevant for automation 
techniques and robotics is the proper identification of the parameters present in a given 
system. For this kind of problems, methods have been derived for the identification of LTV 
dynamics (see, for instance, Lortie & Kearney, 2001). However, in particular applications 
where it is required to estimate in real time the parameters of a given plant with LTV 
behaviour and there is limited computing power for this purpose, it would be desirable to 
resort to another less expensive scheme. 

A traditional scheme used to identify certain parameters of a given system is depicted in 
Figure 1. In this scheme, the output of the system under investigation x is compared to the 
output of the adaptive filter y and the difference e is used to tune the filter. Traditionally, the 
filter is based on an algorithm implemented on a suitable computing platform and adequate 
interfacing circuitry. However, nothing precludes the implementation of such a filter using 
analogue techniques which may lead to the synthesis of systems for analogue computing 
(Cowan et al., 2006). Nowadays circuit design techniques are advanced enough to synthesise 
in a rather convenient way all kinds of static and dynamic linear and nonlinear functions 
without relying in implementations comprising several operational amplifiers, resistors, 
capacitors and multipliers and working with low power consumption indexes and relatively 
high speed processing capabilities (Mulder et al., 1998). For instance, a second-order low- 
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pass filter whose bandwidth depends linearly on the bias current I is depicted in Figure 2 
(Frey, 1993). It is even possible to perform with such techniques quite advanced signal 
processing functions like, for instance, generation of wavelet bases (Haddad et al., 2005). 
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Figure 1. Parameter identification using adaptive filter 
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Figure 2. A second-order low-pass filter implemented with log-domain techniques (Frey, 
1993). 

In this chapter, a strategy to formulate continuous-time dynamical LTV systems with a 
prescribed dynamical behaviour will be proposed. Without any loss of generality, the 
strategy proposed will be first used to derive the model of a second-order LTV system with 
fully configurable modes. Such a model may be used to implement, for instance, an adaptive 
filter for system identification purposes together with an appropriate tuning scheme or may 
even be used for analogue computing (Cowan et al., 2006). Furthermore, the proposed 
model is guaranteed to have a stable behaviour provided that certain system constraints 
may be observed. 

The rest of this chapter is organised as follows: in Section 2, a model for a second-order LTV 
system which may accept fast variations in its parameters without compromising its 
stability will be derived. In Section 3, the stability properties of the proposed model will be 
assessed. The properties of the new model will be compared with previous work done by 
the authors. The behaviour of the new model will be further validated through simulations 
in Section 4. Finally, some concluding remarks will be given in Section 5. 
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2. Synthesis of continuous-time LTV models of n-th order 

Without any loss of generality, consider the second-order LTV differential equation 



x"(t) + ai(t)x'(t) + a (t)x(t) = 



(1) 



where x(t) is a scalar function depending on the time variable t with a prescribed dynamical 
behaviour and ao(t) and ai(t) are also scalar functions of the time variable t which serve as 
time-varying coefficients of (1). 

It is desired that the behaviour of x(t) in (1) may resemble the behaviour of the scalar 
function y(t) arising from the following LTI equation: 



y"(t)+2?roy'(t) + (B2y(t)=0 



(2) 



where § is the damping parameter and oo is the undamped natural frequency associated to 

(2). 

The general solution of (2) is given by 



y(t)= Cie" K + (? 2 -i) V2 )t + C 2 e« (-? - gV 72 )* 



(3) 



where Ci and C2 are arbitrary constants. The general solution of x(t) in (1) should resemble 

the behaviour of the solution of (2). However, the parameters § and 00 presented in (3) must 

be in general time-varying. 

A possible solution is that x(t) is composed by a linear combination of two modes of the 

form 



Xl ( t ) = e (o(t) (- ?(t) + (? (t)-i) ' )dt (4) 

2 1/2 

x 2 (t) = e Mt)(-?(t)-(? W-i) ' )dt (5) 

The proposed modes for (1) are now defined in terms of time-varying parameters. These 
solutions may be used to determine the coefficients ai(t) and ao(t) which are required in (1) 
to have the specified dynamic behaviour. 

In order to obtain the time-varying coefficients of (1) from a set of known solutions, a linear 
algebraic system of equations has to be solved. Substitution of (4) and (5) in (1) leads to the 
following set of linear algebraic equations which may be solved for ai(t) and ao(t) 



Xi'(t) Xi(t) 

x 2 '(t) x 2 (t) 



ai(t) 

ao(t) 



xx"(t) 

■X2"(t) 



(6) 



The solution of (6) is guaranteed to exist since xi(t) and X2(t) are linearly independent 
functions. After solving (6) using (4) and (5), the following expressions for ai(t) and ao(t) are 
obtained 



ai (t) = §(t)[ 209(t) - ?'(t)(?2(t) -1)"1 ] - C9(t)0-l(t) 

a (t) = ci)(t)[2( B (t)-?'(t)(?2(t)-l)-i] 



(7) 
(8) 
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It should be noticed that other definitions for the modes of (1) could have been proposed 
which may resemble the dynamic behaviour of (2) when the time-varying parameters of (1) 
reach a constant value. Such a pair of modes would be, for instance 



xi(t) 



: e «(t) (- ?(t) + (§ 2 (t)-i) 1/2 ) 



x 2 (t) = e»(t) (-?(*) - (?Vi) 1/2 ) 



(9) 

(10) 

However, such a choice would give a very complicated formulation of the functions ai(t) 
and ao(t) in (1). 

In general, the method used for the determination of ai(t) and ao(t) for (1) may be applied to 
determine the time-varying coefficients ai(t), i =0, l,...,n - 1 of the n-th order differential LTV 
equation of the form 

xW(t) + a n - 1 (t)x(n-D(t) + a n . 2 (t)x(n-2)(t)...+ ai(t)x '(t) + a (t)x(t) = (11) 

provided that n linearly independent solutions Xi(t), i = 1, 2,...,n are known. In this case, the 
time-varying coefficients ai(t) will be the solutions of the following system of algebraic 
equations 



Xl("-!)(t) 


Xl (n-2)(t) .. 


• *i'(t) 


xi(t) 




x 2 (»i)(t) 


X 2 K2)(t) .. 


• X 2 '(t) 


X2(t) 




Xn-l( n -!)(t) 


Xn-l("- 2 >(t) .. 


. X n .l'(t) 


Xn-l(t) 




Xn("-D(t) 


Xn("" 2 )(t) .. 


. x„'(t) 


Xn(t) 





a n -i(t) 
a n - 2 (t) 

ai(t) 

ao(t) 



- xiO)(t) 

- x 2 (")(t) 

• Xn-lW(t) 

- XnW(t) 



In this particular case, the existence of the solutions of this system of equations is 
guaranteed to exist since the solutions Xj(t) are linearly independent and therefore their 
Wronksian will be different from zero for all t. In the formulation of a model with set of 
fully configurable modes, and depending on the expected dynamics of the system to be 
modelled, the modes given in expressions (4) and (5) are excellent candidates to represent a 
couple of complex conjugate poles whose exact locations are not known. On the other hand, 
the function x(t) = e 4a(t)dt with o(t) > may be used to represent a real pole located in the 
left plane. As it will be seen soon, the proposed functions represent the best alternative to 
build a system based on equation (11) with complex dynamical behaviour. 
In previous work (Kaszyhski, 2003; Jaskula, & Kaszyhski, 2004; Piskorowski, 2006), the 
following equation was proposed for defining a new class of low-pass time- varying filters 



x"(t)+2§(t)co(t)x '(t)+ ffl 2 (t)x(t) = ffl 2 (t)u(t) 



(12) 



where u(t) is the signal to be filtered, x(t) represents the output signal and §(t) and co(t) 
represent time-varying parameters. If the coefficients of (12) are compared to the coefficients 
defined in (7) and (8) for (1), it can be concluded that the system defined in (1) will 
approximate the behaviour of (12) when the maximum magnitude of the logarithmic 
derivative of function ff>(t) and the maximum magnitude of function §'(t)/(§ 2 (t) - 1) are 
small compared to 2§(t)co(t) and co(t), respectively. In other words, §(t) and co(t) have to vary 
slowly. 
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3. Stability properties of the proposed model 

The stability properties of (1) when ai(t) and ao(t) are defined according to (7) and (8) are 
determined solely by the modes used in the formulation of these coefficients. In the general 
case (i.e. when time-varying parameters §(t) and co(t) vary arbitrarily), a suitable measure of 
the stability of the modes given in (4) and (5) can be obtained if their Lyapunov exponents 
are computed. The Lyapunov exponents Li and L2 associated to modes xi(t) and X2(t) in (4) 
and (5) are given by 



L, = lim supr' In x, (t) = limRef'J co (t) -£(t) + (£ 2 (t) - 1 



dt (13) 



L 2 =lim supt" 1 In x 2 (t) = limRef'J co (t) |~ -£(t) - (£ 2 (t) - l)" 2 1 dt (14) 

In order to assure the stability of (1), Li < and L2 < 0. Therefore, it suffices that the real part 
of those time averages remains negative. This means that function §(t) may take arbitrary 
negative values and the stability of the system will be still guaranteed, provided that the 
average value of co(t) [-§(t) ± (§ 2 (t) - 1)V 2 ] is negative over the whole time interval in which 
(1) is operating. This gives added flexibility when implementing a control for modulating 
§(t) and oo(t) according to a predefined rule. 

An important issue behind the proposed model in (1) is the possibility that it might be 
bounded-input bounded-output (BIBO) stable when the right hand of (1) contains a non- 
zero term. In order to guarantee this type of stability, it suffices that the homogeneous 
response of (1) is exponentially asymptotically stable and that the coefficients ao(t) and ai(t) 
are bounded for all t (Anderson & Moore, 1969). In this case, both conditions are satisfied 
and therefore the proposed model has BIBO stability. The proposed modes in expressions 
(4) and (5) have exponential asymptotic stability provided that oo(t) and §(t) remain positive 
for all t. In order to guarantee the boundedness of the coefficients of (1), co(t) hast to be 
different from zero for all t, whereas §(t) has to be different from 1 for all t. 
In the particular case of the system governed by equation (11), the considerations given in 
(Anderson & Moore, 1969) to guarantee its BIBO stability when the right hand of (11) is non- 
zero are also applicable. It should be noticed that if the dynamics of the system should 
include a real pole in the left plane whose location is not known, function x(t) = e -Io(t)dt nas 
also exponential asymptotic stability provided that o(t) > for all t. Moreover, the average 
value of o(t) will define the Lyapunov exponent associated to that solution. 
The results obtained so far should be compared against previous work. In (Kaszyhski, 2003), 
the following conditions for assuring the stability of (12) were given 

m(t) > (15) 

?(t) > (16) 

I cD'(t) I < I 2?(t)«2(t) I (17) 

Such conditions were obtained using the second Lyapunov method. Conditions (15) and 
(16) are sufficient in (1) to ensure stability for all t. With these conditions, it is guaranteed 
that the arguments of the integrals in (13) and (14) are always negative, and therefore the 
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Lyapunov exponents computed in this way will be also negative. Condition (17) imposes an 
additional constraint on (12): in order to ensure stability, co(t) may not be varied above the 
limit imposed by the product 2§(t)(D 2 (t), thus limiting the tunability range of this parameter. 
System (1) with coefficients ai(t) and ao(t) as defined in (7) and (8) does not show the 
shortcoming described above. 

To further demonstrate that the dynamics of (1) and (12) are different, the sum of the 
Lyapunov exponents will be computed. According to (Nemytskii & Stepanov, 1989), the 
sum of the Lyapunov exponents v of a regular n-th order LTV system of the form (11) is 
given by 

v =- lim supr' J 'a n -i( T ) dr ( 18 ) 

If the coefficient associated to the x'(t) term of (12) is compared to (7), it will be verified that 
the only way to make the sum of Lyapunov exponents equal for both systems is forcing the 
product 2§(t)co(t) to be the same in both systems and cancelling the time variations of §(t) 
and oo(t). This means that if the sum of Lyapunov exponents for (1) and (12) is meant to be 
the same, both systems must be LTI and have the same set of eigenvalues. 

4. Performance evaluation 

In order to verify the stability properties of the proposed model, its response will be tested 
for a predefined test signal (a step function). Therefore, x(t) will be estimated numerically 
from the following equation 

x "(t)+ ai(t)x'(t) + a (t)x(t) = a (t)u(t) (19) 

where coefficients ai(t) and ao(t) are defined as in (7) and (8) and u(t) is the step function. 
The obtained response will be compared to the response obtained from a reference LTI 
system and from the LTV filter based on equation (12) subject to the same parameter 
variation. It will be assumed that the LTI reference filter is a second-order lowpass 
Butterworth filter with poles located on -5 ± 5j. Furthermore, it will be assumed that co(t) 
and §(t) are given by the following expressions 

m(t) = [50+50e-«' + 100e- 20t ]i/2 (20) 

§(t) = (10 + 10 e-20')/2(D(t) (21) 

Although it has been already demonstrated a long time ago that the eigenvalues obtained 
from an arbitrary LTV system by means of the well-known characteristic equation for LTI 
systems do not convey any valid information regarding the stability properties of the system 
or its solution (Vinogradov, 1952), with the definition given for oo(t) and §(t) in (20) and (21) 
the instantaneous location of the roots of the characteristic equation associated to (12) is 
modulated exponentially in time from -10 ± 10] at t =0 to -5 ± 5j within a relatively short 
time interval. With a time constant of 0.05 seconds for the modulation factor, the roots 
associated to the characteristic equation of (12) should be very close to the poles of the 
reference LTI system after 0.2 seconds. 

The simulated responses obtained in Mathematica for the LTI reference system, the filter 
described based on (12) and the proposed LTV model are depicted in Figure 3. From the 
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simulation results it can be seen that the responses of both LTV systems become identical to 
the response of the LTI reference system as t increases. Although the response generated by 
(12) rises faster compared to the response generated by (19), the proposed model produces a 
smaller overshoot. 

In Figure 4, the response to the step function of the systems defined by (19) and (12) is 
depicted. In this particular case, it was assumed that a(t) and §(t) were varied according to 
the following relations 

m(t) = [50 + 90 sin(lOnt) + 40.5 sin2(10nt)]V2 (22) 

§(t) = [10 + 9 sin(lOnt) ]/ 2oo(t) (23) 



LTI 
LTV 

LTV2 




0.20.40.60.8 1 1.21.4 



Figure 3. Responses of (12) and (19) to a step response with an exponential variation of ff>(t) 
and §(t). In this figure, the acronym LTV stands for the response of (12) whereas the 
acronymLTV2 stands for the response of (19) 

With the chosen variation parameters for co(t) and §(t), the roots of the characteristic 
equation associated to (12) will vary periodically according to the following relation 

pi, 2 = -5 ± 5j +(-4.5 ± 4.5j) cos(10nt) (24) 

where pi and p2 stand for the roots of the characteristic equation of (12). With this variation 
scheme, the condition given in (17) for (12) is not valid anymore. However, in the case of 
(19), the stability measures given in (13) and (14) for equation (1) (Li = L2 = -5), the 
exponential asymptotic nature of its homogeneous response and the boundedness of its 
coefficients guarantee that the output of (19) will be bounded (Anderson & Moore, 1969). 
Given that (12) is expected to show an unstable behaviour, backward differentiation 
formulae were chosen to solve numerically equations (19) and (12). From the results of the 
simulation it can be seen that both LTV systems reach in a finite time their steady-state 
behaviour. However, the system described by (12) displays a somewhat larger overshoot 
before it reaches steady-state behaviour. 

It is noteworthy to consider the role of the scaling coefficient of the input function u(t) in 
equations (12) and (19). According to condition (17), the output of equation (12) should be 
unstable if the coefficients given in expressions (23) and (24) are considered. However, the 
simulation results of Figure 4 show that the output of the filter described by equation (12) is 
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stable. If the scaling coefficient of the input function u(t) present in equation (12) is removed 
and the following equation is instead considered 

x "(t) + 2§(t)«a(t)x «(t) + «)2(t)x(t) = u(t) (25) 

it is possible to demonstrate that the stability condition given in (17) is indeed broken. 

Output 

1.75 F 
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Figure 4. Responses of (12) and (19) to a step response with a periodic variation of co(t) and 
?(t). 
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Figure 5. Responses of (25) and (26) to a step response with a periodic variation of oo(t) and 

?(t) 

In Figure 5, the response of equation (25) with periodic coefficients co(t) and §(t) as defined in 

(23) and (24) to a unit step function is depicted. This response is compared to the response of 

the LTV differential equation 

x "(t) + aj(t)x '(t) + a (t)x(t) = u(t) (26) 

and to the response of the lowpass filter described by the following LTI differential equation 

x"(t) + 50x'(t) + 10x(t) = u(t) (27) 

In expression (26), coefficients ao(t) and ai(t) are defined as given in (8) and (7). From Figure 
5 it may be noticed that the responses of equations (25) and (26) are oscillatory and do not 
track the input function u(t). In this case, function u(t) is assumed to be a step function. 
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Figure 6. Time series considered for the output of (25) 

LTV2 




Figure 7. Time series considered for the output of (26) 

A detailed analysis of a portion of the time series obtained from (25) and (26) gives more 
information on the results obtained through simulation with Mathematica. The time series 
considered for analysis appear in Figures 6 and 7. In a first attempt, Lyapunov exponents 
may be determined to establish the nature of the behaviour displayed by the systems 
represented by equations (25) and (26). However, given the periodic character of their time 
series, other methods should be used to assess the stability of (25) and (26) (Sprott, 2003). 
In order to determine whether the stability condition given in (17) is broken or not for 
equation (25), the power spectra of the time series considered should be obtained. According 
to (Sigeti, 1995), the power spectrum of a given signal with positive Lyapunov exponents 
has an exponential high-frequency falloff relationship. Such characteristic in the frequency 
domain is due to the fact that the function which defines the signal under consideration has 
singularities in the complex plane when the time variable t is seen as a complex variable and 
not as a real one (Sigeti, 1995). When the Fourier transform is computed for such a signal, 
the singularities must be avoided in the complex plane through an adequate integration 
path and in this way exponential terms appear on its associated Fourier transform (Sigeti, 
1995). In the presence of noise, the exponential frequency falloff relationship will be 
noticeable up to a given frequency and afterwards it will decay as a power of f _n where f is 
the frequency and n a natural number (Lipton & Dabke, 1996). These phenomena are also 
observable in chaotic systems as well, independently of the appearance of attractors or not 
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in their dynamic behaviour (van Wyk &. Steeb, 1997). When there are no singularities in the 
complex variable t present in a given signal and in the absence of noise, its power spectrum 
will decay at high frequencies as a power of f- n as well (Sigeti & Horsthemke, 1987). 
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1.02782 x 10-9 



Table 1. Discrete power spectrum for the output of equation (25) 



Frequency (Hz) 


Magnitude 





8.78076 x 10-2 
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4.50563 x 10-3 


10 


8.13031 x 10-4 


15 


5.94339 x 10-5 


20 


7.58989 x 10-6 


25 


2.38456 x 10-6 
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8.0822 x 10-7 


35 


3.04263 x 10-7 


40 


1.23233 x 10-7 


45 


5.27326 x 10-s 


50 


2.35502 x 10-8 


55 


1.08831 x 10-8 


60 


5.1718 x 10-9 



Table 2. Discrete power spectrum for the output of equation (26) 

Given that the numerical solutions obtained for equations (25) and (26) are periodic, their 
power spectra turn out to be discrete. In Tables 1 and 2 the magnitude of the harmonic 
components of the responses computed via Mathematica has been tabulated. The data given 
in Table 1 was used to obtain the best fit in Mathematica using routine NonlinearFit[ ] for 
expressions 

y = Aie B i £ (28) 

and 

y = A 2 f B 2 (29) 

where Ai, A2, Bi and B2 are fitting parameters. If the data given in Table 1 is considered from 
f = 15 Hz for the fitting process, the constants Ai and Bi which fit best expression (28) are 
equal to 0.535465 and -0.810056 respectively. With the same data, the constants obtained for 
the best fit of expression (29) are A2 = 175010 and B2 = -9.17964. In Figure 8 expressions (28) 
and (29) are plotted together with the original data and it can be seen that the exponential 
curve matches better the obtained data from equation (25) at high frequencies. The same 
procedure was carried out with the data presented in Table 2. The coefficients obtained for 
expression (28) were Ai =0.0244961 and Bi = -0.401458 whereas for expression (29) the 
coefficients were A2 = 10292.2 and B2 = -7.00509. From Figure 9 it can be seen that 
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Figure 8. Power spectrum obtained for the time series of (25) 
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Figure 9. Power spectrum obtained for the time series of (26) 

expression (29) gives the best fit for the data obtained from equation (26) at frequencies 
greater than 15 Hz. Given that condition (17) is broken, it can be thus safely concluded that 
the response obtained from equation (26) is unstable when oo(t) and §(t) are defined as given 
in expressions (23) and (24). In the case of equation (26), under the same conditions, it turns 
out that its response is bounded. The response of equation (26) is bounded for a bounded 
input because the conditions given in (Anderson & Moore, 1969) for BIBO stability are 
enforced. 

5. Conclusions 

In this chapter, a strategy for the formulation of a LTV scalar dynamical system with 
predefined dynamic behaviour was presented. Moreover, a model of a second-order LTV 
system whose dynamic response is fully adaptable was presented. It was demonstrated that 
the proposed model has a exponentially asymptotically stable behaviour provided that a set 
of stability constraints are observed. Moreover, it was demonstrated that the obtained 
system is BIBO stable as well. Finally, it was shown via simulations that the response of the 
proposed model reaches with a smaller overshoot its steady-state response compared to the 
response of a LTV lowpass filter proposed previously. 
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1. Introduction 

Control limits are ubiquitous in real world, in any application, thus taking them into 
consideration is of prime importance if one aims to achieve high performance of the control 
system. One can abide constraints by means of two approaches - the first case is to impose 
constraints directly at the design of the controllerwhat usually leads to problemswith 
obtaining explicit forms (or closed-form expressions) of control laws, apart from very simple 
cases, e.g. quadratic performance indexes. The other approach is based on assuming the 
system is linear and having imposed constraints on the controller output (designed for 
unconstrained case - bymeans of optimisation, using Diophantine equations, etc) one has to 
introduce necessary amendments to the control system because of, possibly, active 
constraints (Horla, 2004a; Horla, 2007d; Ohr, 2003; Peng et al, 1998). 

When internal controller states do not correspond to the actual signals present in the control 
systems because of constraints, or in general - nonlinearity at controller output, then such a 
situation is referred in the literature as windup phenomenon (Dona et al., 2000; Horla, 
2004a; Ohr, 2003). It is obvious that due to not taking control signal constraints into account 
during the controller design stage, one can expect inferior performance because of 
infeasibility of computed control signals. 

Many methods of anti-windup compensation are known from the single-input single-output 
framework, but a few work well enough in the case of multivariable systems (Horla, 2004a; 
Horla, 2004b; Horla, 2006a; Horla, 2006b; Horla, 2006c; Horla, 2007a; Horla, 2007b; Horla, 
2007c; Horla, 2007d; Ohr, 2003; Peng et al., 1998; Walgama & Sternby, 1993). For 
multivariable systems have additional feature - windup phenomenon is tightly connected 
with directional change phenomenon in the control vector due to different implementations 
of constraints, affecting in this way the direction of the computed control vector. Even for a 
simple amplitude-constrained case, the constrained control vector u t may have a different 
direction than a computed control vector Xh. The situation is even more complicated for 
amplitude and rate-constrained system, where for additional requirements, e.g. keeping 
constant direction, there may be no appropriate control action to be taken (Horla, 2007d). 
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Apart from windup and directional change phenomena one can expect to obtain inferior 
performance because of problems with (dynamic) decoupling, especially when the plant 
does not have equal numbers of control signals and output signals (Albertos & Sala, 2004; 
Maciejowski, 1989). In such a case, control direction corresponds not only to input principal 
directions (or maximal directional gain of the transfer function matrix), but also to the 
degree of decoupling, and by altering it one achieve better decoupling (though not in all 
cases). 

Directional change has been discussed in (Ohr, 2003; Walgama and Sternby, 1993), where 
the first description of the problemwas given, connections in between anti-windup 
compensation and directional change has been made. A review of multivariable anti- 
windup compensators has been included in (Horla, 2007b; Peng et al., 1998; Walgama & 
Sternby, 1993) with basic analysis of the topic. 

Windup phenomenon (thus decoupling and directional change) are tightly connected with 
industrial applications are crucial when control laws are to be applied. Many papers treated 
application of anti-windup compensation (AWC) in areas as motor drives, paper machine 
headbox or hydraulic drives control. But they lack in understanding what is the connection 
in between directional change and AWC. 

To recapitulate, when control limits are taken into consideration, the presence of windup 
phenomenon requires certain actions to compensate it, i.e., to retrieve the correspondence of 
the internal controller states with its (vector) output. Heuristic modifications feeding back to 
the controller the portion of controls changed by nonlinearity are performed by a posteriori 
antiwindup compensators and a priori AWCs enable windup phenomenon avoidance (i.e., a 
priori compensation) by generating feasible control actions only (Dona et al., 2000; Horla, 
2006a; Horla, 2006b). 

Having avoided generation of infeasible control actions one avoids windup phenomenon in 
a priori manner, and implicitly eliminates windup phenomenon that would inevitably take 
place had control limits not been taken into consideration first. 

The chapter aims to focus on directional change issues (and, simultaneously, anti-windup 
compensation) in multivariable state-feedback controller with a priori anti-windup 
compensator for systems given in state-space form. The problem is presented through the 
framework of linear matrix inequalities (LMIs). Imposing only amplitude constraints on the 
control vector results in LMI conditions, but taking rate constraints into consideration 
results in nonsymmetric matrix inequality, that is transformed into LMI by making certain 
assumptions, as in (Horla, 2007a). 

2. Control vector constraints and directional change 

Let us suppose that amplitude constraints are imposed on the input signals of two-input 
plant. Depending on the method of imposing constraints one can observe directional 
change, illustrated in Fig. la in the case of cut-off saturation that is not present when 
saturation is performed according to imposed constraints (dashed lines) with constant 
direction, Fig. lb. 

The situation is more complicated when rate constraints are taken into consideration. Let 
@& denote the set of all feasible control vectors due to amplitude constraints and 9a denote 
the set of all feasible control vectors due to rate constraints. If 3>a n ®a *■ than constrained 
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input vector is feasible. The aim is to constrain the control vector so that as much of its 
primal information is kept with minimum directional change (Horla, 2007c). 
Let the computed control vector violate amplitude constraints and have the property that its 
amplitude-constrained companion does not violate rate constraints, i.e. «t £ ®ii (see Fig. 2a). 
The necessary condition here for control direction to be preserved is as above, for such a 
case only two sets: a point («i,t, W2,t) in the plane and ®«jhave a common part. 

directional 
s\ change 




Fig. 1. a) direction-changing, b) direction-preserving saturation (left: control vector before 
saturation, right: after saturation) 

When the point on the end of direction-preserved, amplitude-constrained, computed control 
vector and 00 do not have a common part, than it is impossible to generate feasible control 
actions with both amplitude and rate constraints imposed so that direction of the computed 
control vector is sustained. This is depicted in Figure 2b, where the only constrained control 
vector lies „as close as possible" to the computed control vector satisfies u t e 3)<x n ^and Ut 
e b(S?#) (boundary of 3$. When rate constraints are violated, one has to treat them either as 
secondary constraints (to be omitted) or introduce soft rate constraints instead of hard rate 
constraints (Maciejowski, 1989). 



3. Directional change phenomenon, an example 

Let two-input two-output system be not coupled and both loops be driven by separate 
controllers (with no cross-coupling). The systemoutput y t is to track reference vector 
comprising two sinusoid waves, what corresponds to drawing a circular shape in the (yy yi) 
plane. 
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Fig. 2. a) direction-preserved saturation, b) saturation with directional change 
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Fig. 3. a) unconstrained system, b) cut-off saturation, c) direction-preserving saturation 
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As it can be seen in the Fig. 3a, the unconstrained system performs best, whereas in the case 
of cut-off saturation imposed on both elements of control vector (Fig. 3b) the tracking 
performance is poor. This is because of directional change in controls that changes 
proportions between its components. In the application for, e.g., shape-cutting, performance 
of the system from Fig. 3c (direction-preserving saturation) is superior. Furthermore, in 
order to achieve such a performance the system must be perfectly decoupled at all times 
(Horla, 2007b; Ohr, 2003). 





u t 

I + 




-1 ~\ 




tflRU 



so'ido-i'; 



Fig. 4. results from closed-loop system a) without AWC, b) with AWC 

4. Anti-windup compensation, an example 

An example action of AWC is shown in Figure 4, where for hard constraints imposed on the 
control signal and pole-placement controller it is impossible to ensure tracking properties if 
windup phenomenon is left uncompensated. On the other hand, by performing 
compensation, the control signal is desaturated and is not prone to consecutive 
resaturations, operating in a period of time in a linear zone. 



5. How to understand windup phenomenon in multivariable systems - 
literature remarks 

The problem of windup phenomenon in multivariable systems with its connection to 

directional change in controls has rarely been addressed in the literature. The only valuable 

remark concerning directional change is in (Walgama and Sternby, 1993): 

Solving the windup phenomenon problem does not mean that constrained control vector is of the 

same direction as computed control vector. 

On the other hand, avoiding directional change in control enables one to avoid windup phenomenon. 

In further parts of this chapter, it will be described where the latter description holds. 
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6. Considered plant model 

The directional change issues are discussed for state-feedback control law that has been 
derived for shifted-input «* e <% m and shifted-output y ' e &p plant in the CARMA 

structure, taking into account the offset resulting from the current set-point vector for plants 
without integration (in a steady-state) 

A{q- l )y s t =B{q-')u s t _ d , (1) 

represented in state-space representation for non-shifted (original) inputs and outputs: 

x t+l = Ax t + Bu t _ d , (2) 



with 



A = 



V t = Cx t , 



-A n -i 
-A n 



(3) 



(4) 



B = [Bi , ..., B n ] , 



C = [I, 0, . . . , 0] T . , 
A(q~ l ) = I + Aig" 1 + - ■ • + A n q~ n , 



(5) 
(6) 

(7) 



B(9 _1 ) = Bi + B 2 q 1 + ■ ■ ■ + B n -iq " . 



(8) 



The aim of state-feedback controller is to track a given reference vector r t e ggf with plant 
output vector y ' minimising certain performance index. The offset previouslymentioned: 



8 . 



(9) 



S* = M ( +M». 



y t = £*+£«< 



where for plants without integral terms 

x^ =(I-A)- l Bu oat 



(10) 
(11) 

(12) 
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»»=**• (13) 

with u m e ,^ nl being the control vector value assuring tracking in steady state, i.e. 

A{l) U = B(l)u x . (14) 

For plants comprising integral terms there is no need to shift states because u x * holds. In 

the case of m ? p, the control vector offset results from (14) by means of pseudoinverse. 
The considered state-feedback controller enables one to constrain the part of u t which is 
related to the answer of the plant to non-zero initial conditions at instant t, which 
subsequently corresponds to the current values of reference vector. In other words, it 

enables constraining the absolute value of the elements of u ' that are above absolute value 

of u m . 

In order to fully analyze the directional change phenomenon interplay with AWC, rate 

vector constraints Au_ t = u t - Mt-i are also to be taken into account here (Horla, 2007c; Horla, 

2007a). 

7. State-feedback controller 

At each time instant the optimal state-feedback matrix is generated (Horla, 2006b) 

F* t :MFt) = Ft M jt) {J t (F t )} } (15) 

assuming that r_ t is of constant value r, Ut - Ft X t, state vector is perfectly measured and 

constraints are imposed on the control vector. The set 3 denotes all F s , for which the 

performance index J t is of finite value. 

At each time instant the optimisation procedure of F* is performed for current values of 

reference vector, and at the next instant the procedure is repeated again. 

The performance index 

DO 

A-EflMo.+llllillH.). (16) 

i=t 

is a quadratic function and is minimised subject to Ft, with state-feedback control law 
M t = F t x t. As it has been already said, plant state is fully accessible (if not, one can perform 
estimation on the basis of the separation theorem). 

In order to minimise the performance index (16), its upper bound is to be found. Let the 
following Lyapunov function be given 

V(x t ) = x T t Px t , (17) 

with positive definite P > 0, and V (0) = 0. Having assumed that at each time instant t holds 

V(x t+1 ) - V(x t ) ^ - (gQ&t+lgRWt) , (18) 
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which left - and right-hand side sum from t = t to t - oo satisfies 

-V(x t ) $ -J t , (19) 

the performance index (16) is bounded from above with 

Jt^V(x t ), (20) 

from where using (17) one obtains 

Jt =S ^Px t . (21) 

Minimisation of the quadratic form x t P x_t, with P > 0, on the basis of Schur complement 
for x t Px t < y is equivalent to linear matrix inequality (LMI) (Boyd et al., 1994; Boyd and 
Vandenberghe, 2004) 



mm 7 



s.t. 



1 * 
Z t Q 



>0 



(22) 



for y> and Q = yP~\ 

Having substituted (17) into (18) 

x t+l Px t+l — x t Px t ^ — ( x t Q l x t + u t rt\u t j , 

using (2), and putting u t =Fit one can write 

((A + BF)x t ) T P((A + BF)x t )-x T t Px t + x T l Q l x t + (Fx l ) T R 1 (Fx l ) <0 
which negative-definiteness is equivalent to 

(A + BFf P(A + BF) -P + F T RiF + Q x ^ . 



(23) 



(24) 



(25) 



Putting P = yQ^ 1 post- and pre-multiplying with Q T and Q, and putting Y = FQ, the 
inequality can be rewritten as 



Q- -Y T R l Y - - Q T Q 1 Q - (AQ + BYfQ-'iAQ + BY) ^ 0. 

7 7 

Applying the Schur complement again one obtains 



(26) 



Q-LY T R 1 Y-^Q T Q 1 Q * 
AQ + BY Q 



^ 0, 



Q * 

AQ+BY Q 



Q T Q l Q+Y T R l Y * 



Q * 

AQ+BY Q 



\Q T Q\ /2 Y T R\ /2- 




Q * 

AQ+BY Q 



q t q¥* y t r\ /2 





■yl 
7/ 



Q\ n Q 





R\ n Y 





qTq 





r¥ 2 y 






(27) 



> 0. 
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Having applied the Schur complement again, the optimal control law minimising (17) is 
given as 



with (time index has been omitted) 



where Q > and Y are solutions of 



u t = F t x t , 



F = YQ\ 



mm 7 



s.t. 



1 * 

fit Q 



so, 



(28) 

(29) 

(30) 
(31) 



Q * * * 
AQ+BY Q * * 

Q\ /2 Q fl * 

R\ l2 Y 7-T. 



SO. 



(32) 



One can also take symmetrical control vector constraints into account. Let 

llwjlmax = max|u M | , 
1 < i < m, with u t e <% m . If there exist y, Q, Y , satisfying (30)-(32), than there holds 

HfiJ < P, 
and every state llfit+jll ^ P \3 ~> 0) lies inside the invariant ellipsoid £p = {% '• *\z\\p =5= p)t 
with p = vx p for invariant reference vector (Horla, 2007a; Horla, 2006b). 

Imposing symmetrical constraints on amplitudes of elements of w t is equivalent to imposing 
constraints on 



||u t H»»i = \\Fx t U*x = \\YQ 'z J ma x *S m|x ||YQ 'sIL.,, 



(33) 



||« t ||m« ^max\\YQ- 1 x\\ max = max(YQ- 1 Y T )i i . 



(34) 



Than imposing different constraints on each of the elements of control vector is equivalent 
to LMI 



Y T Q 



SO,. 



(35) 



Where A „= diag {a|",...,a 2 }isa diagonal matrix comprising squares of amplitude 
constraints of ui, t, . . . , u m , t on its diagonal. 
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The amplitudes of elements of the control vector w t are constrained if the above LMI is 

added to the set of LMIs (30)-(32). 

The presented state-feedback control law stabilises the closed-loop system, guaranteed by 

the existence of invariant ellipsoid describing V (x t ) with fully known plant and constant 

reference vector. 

In a similar manner, one can impose rate of changes constraints. If Xt ~ Xt-i is a state-vector of 

approximately constant value, or in a steady-state, than 

Au t - Fx t - Ft-ix^ = (F- F t -!)x t , (36) 

where F t -i is a matrix computed at previous sample and F is the sought matrix computed at 
the current time instant. 

As it has been stated in (Horla, 2007a), imposing symmetrical constraints on rates of 
elements of u_ t is equivalent to imposing constraints on 

IIAttJUax^miJElKyQ-^Ft-Oxll^^maxtyQ- 1 ^-^^ (37) 

where F * = Ft-iY T should be a symmetrical matrix. 
Assuming tr 
constraint as 

||Aw ( || ma:i =£ maxfyQ-'y 7, - Ft-iYj^a , (38) 

or as LMI 



Assuming that Ft-iY — Ft-iY t _i hold, one can obtain approximate rough rate 



Ap + Ft-iYj-! * 
Y T Q 



SsO, (39) 



where A3 = diag {ft , . . . , /3 m } is a diagonal matrix comprising squares of rate constraints 
of ui, t, . . ■ , u m , ton its diagonal. 

The rates of elements of the control vector w t are constrained if the above LMI is added to the 
set of LMIs (30)- (32). 

8. Performance index, evaluation of directional change 

Evaluation of control performance that is coupled with anti-windup compensation requires 
following indices to be introduced: 



J i = i E E i r <.< - ».«i - ( 4 °) 

j * = ifibib (*■«.* -w.*) 2 , (4i) 

i=l t=l 

^i = ^Eb(a t )-vKfit)l PI. (42) 
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1 



(43) 



where (40) corresponds to mean absolute tracking error of p outputs, (42) is a mean absolute 
direction change in between computed and constrained control vector, and <p denotes angle 
measure. In the case of m = 3, cp corresponds to absolute angle measure (in which case there 
is no need to decide its direction). 

9. Plant models for simulation studies 

The considered plants are taken with delay d — 1 and are cross-coupled (values of control 
offset have been given for reference signals of amplitude 3, where in the case of P3 the last 
amplitude of reference signal is zero): 
• PI (m = 2, p = 2) 



A = 
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P2 (m = 3, p = 2) 
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10. Simulation studies 

The performed simulations present anti-windup compensation performance for three 
different plants and hard constraints imposed on the components of the control vector. 
There are two systems simulated and depicted in the Figure 5, namely systemwith 
amplitude constraints only and with added rate constraints, clipping only a part of the rates 
of changes in order to enable a comparison. From Table 1 it is visible that for this case of 
system with equal number of inputs and outputs, thus possible for dynamic decouplingwith 
time-varying state-feedback control law, introducing additional constraints causes 
performance indices to increase (i.e. there is more windup phenomenon in the system) and, 
simultaneously it causes more severe directional change. 
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This is because of the fact that one-step-ahead cannot foresee future changes of reference 
signals, nor can assure perfect decoupling. Since imposed constraints alter the direction of 
unconstrained control vector, directional change is necessary to alter the coupling in the 
system, in order to achieve better control performance. 
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Pla 


i=l i=2 


Ji2 


2.0997 5.0601 
2.2919 5.9127 


Ji 


4.3916 10.9728 



P2a 


i=l i=2 




2.1408 5.3081 
2.4300 7.0549 


Ji 


4.5708 12.3630 



P3a 


i= 1 i = 2 


Jn 

Ji2 

•hi 


1.6947 3.4929 
2.2419 5.5571 
0.3613 0.2092 


Ji 


4.2979 9.2592 



Plb 


i = 1 i = 2 


Jn 

Ji2 


2.1103 5.1055 
2.3239 6.1269 


Ji 


4.4342 11.2324 



P2b 


i= 1 i=2 


Jn 

Ji2 


2.1386 5.3000 
2.4353 7.1140 


Ji 


4.5739 12.4140 



P3b 


i=l i=2 


Jj3 


1.7133 3.5442 
2.2101 5.5040 
0.3548 0.1982 


J< 


4.2782 9.2464 



9i 



Pla 



P2a 



P3a 



2.8383 11.7238 7.9377 
31.2958 347.2096 250.0595 



■f-2 



Plb 



P2b 



P3b 



3.5644 12.7296 8.8754 
52.9696 434.2473 310.9078 



Table 1. Performance indices: a) without, b) with rate constraints 

In the case of plant with greater number of control inputs than plant outputs (P2), Figure 6, 
it is visible that in comparison with PI the directional change is more severe, because of 
three components of control vector changing in time. There are more degrees or freedom 
(ways in which control vector may very in time, it varies in space to be exact, instead as 
previously in plane), thus it is necessary to alter its direction in order to improve 
decoupling. Once again performance indexes tend to increase with rate constraints added. 
For the last plant considered, P3, it is obvious that since the number of control inputs is 
deficient in comparison with plant outputs, direction of control vector corresponds to good 
tracking performance, thus having introduced rate constraints it should not be altered 
excessively. By comparison of directional change indices and performance indices for P3 it is 
visible that by introducing rate constraints one can avoid bumps in control signals, what 
leads to better tracking. 



11. Concluding remarks 

Solving the windup phenomenon problem does not have to mean that constrained control 
vector is of the same direction as computed control vector if cross-coupling is present in the 
control system. The presented state-feedback control law by avoiding generation of 
infeasible control actions compensated windup phenomenon in a priori manner, though as 
it can be seen, directional change is present in the system in order to assure better 
decoupling. 

On the other hand, avoiding directional change in control enables one to avoid windup 
phenomenon if and only in the plant is perfectly decoupled or is not coupled at all. The 
latter due to the constraints is hardly ever met, and if the application of some control law 
does no require to preserve direction of control vector (what might correspond to, e.g., 
keeping proportions in between control signals), one should allow directional change to take 
place. This will both improve anti-windup compensation performance and plant 
decoupling. 
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1. Introduction 



The minimum phase systems have ultimately been redefined for LTI discrete-time systems, 
at first SISO and later square MIMO ones, as those systems for which minimum variance 
control (MVC) is asymptotically stable, or in other words those systems who have 'stable' 
zeros, or at last as 'stably invertible' systems. The redefinition has soon been extended to 
nonsquare discrete-time LTI MIMO systems (Latawiec, 1998; Latawiec et al., 2000) and 
finally to nonsquare continuous-time systems (Hunek, 2003; Latawiec & Hunek, 2002), 
giving rise to defining of new 'multivariable' zeros, i.e. the so-called control zeros. Control 
zeros are an intriguing extension of transmission zeros for nonsquare LTI MIMO systems. 
Like for SISO and square MIMO systems, control zeros are related to the stabilizing 
potential of MVC, and, in the input-output modeling framework considered, are generated 
by (poles of) a generalized inverse of the 'numerator' polynomial matrix B(.) . Originally, 

the unique, so-called T -inverse, being the minimum-norm right or least-squares left inverse 
involving the regular (rather than conjugated) transpose of the polynomial matrix, was 
employed in the specific case of full normal rank systems (Hunek, 2002; Latawiec, 2004). The 
associated control zeros were later called by the authors 'control zeros type 1' (Latawiec, 
2004; Latawiec et al., 2004), as opposed to an infinite number of 'control zeros type 2' 
generated by the myriad of possible polynomial matrix inverses, even those called r - and 
<j -inverses also involving the unique minimum-norm or least-squares inverses (Latawiec, 
2004; Latawiec et al., 2005b). Transmission zeros, if any, are included in the set of control 
zeros; still, we will discriminate between control zeros and transmission zeros. In the later, 
new 'inverse-free' MVC design approach based on the extreme points and extreme 
directions method (Hunek, 2007; Hunek & Latawiec, 2006), it was possible to design a pole- 
free inverse of the polynomial matrix B(.) so that no control zeros did appear. Well, except 
when transmission zeros are present, in which case the extreme points and extreme 
directions method does not hold. In the current important result of the authors, the Smith 
factorization of the polynomial matrix B(.) can lead to its pole-free inverse, when there are 
no transmission zeros. Well, provided that the applied inverse is just the T -inverse, the 



374 New Approaches in Automation and Robotics 

intriguing result bringing us back to the origin of the introduction of control zeros. And in 
case of any other inverse of Smith-factorized B(.) we end up with control zeros. 
The remainder of this paper is organized as follows. System representations are reviewed in 
Section 2. Section 3 presents the problem of minimum variance control for discrete-time LTI 
MIMO systems. Section 4 describes the new approach to MVC design, confirming the 
Davison's theory of minimum phase systems and indicates the role of the control zeros in 
robust MVC-related designs. A simple simulation example of Section 5 indicates favorable 
properties of the new method in terms of its contribution to robust MVC design. New 
results of the paper are summarized in the conclusions of Section 6. 

2. System representations 

Consider an n u -input n y -output LTI discrete- or continuous-time system with the input 
u(t) and the output y(t) , described by possibly rectangular transfer-function matrix 

GeR " " (p) in the complex operator p , where p = z or p = s for discrete-time or 
continuous-time systems, respectively. The transfer function matrix can be represented in 
the matrix fraction description (MFD) form G(p) = A~ (p)B(p) , where the left coprime 
polynomial matrices AeR ' " [p] and B e R ' " [p] can be given in form 
A(p) = p"I + ... + a n and B(p) = p m b +... + b m , respectively, where n and m are the orders of 
the respective matrix polynomials. An alternative MFD form G(p) = B(p)A~ (p) , involving 

right coprime AeR ■'' " [p] and BeR " y [p], is also tractable here but in a less 

convenient way (Latawiec, 1998). Algorithms for calculation of the MFDs are known 
(Rosenbrock, 1970; Wolowich, 1974) and software packages in the MATLAB's Polynomial 
Toolbox® are available. Unless necessary, we will not discriminate between 

A{p~ 1 ) = l + ... + a n p~ n and A(p) = p n A(p~ 1 ) , nor between B(p~ 1 ) = b + ... + b m p~ m and 

B(p) = p m B(p~ ) . In the sequel, we will assume for clarity that B(p) is of full normal rank; a 
more general case of B(p) being of non-full normal rank can be easily tractable (Latawiec, 
1998). Let us finally concentrate on the case when normal rank of B(p) is n„ ('symmetrical' 
considerations can be made for normal rank n u ). Now, for discrete-time systems we have 
A(z^) = z' n A(z) = I + ... + a n z- n and B(z- 1 ) = Z - m B(z) = b_ +... + b_ m z~ m , with G(z) = A' 1 (z)B(z) = 
= z~ AT (z~ )B(z~ ), where d=n-m is the time delay of the system. The analyzed MFD 
form can be directly obtained from the AR(I)X/AR(I)MAX model of a system A(q~ l )y(t) = 
= q~ d B(q~ 1 )u(t) + [C(ci~ 1 )/D(q~ 1 )]v(t),v/heve q~ X is the backward shift operator, y(t)eR"», 
u(f)sR"" and v(t)eK " are the output, input and uncorrelated zero-mean disturbance, 
respectively, in (discrete) time t; A and B as well as A and CsR " " [2] are relatively 
prime polynomial matrices, with C(z~ ) = c Q +... + c k z~ and A:<n,andthe D polynomial in 
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z~ -domain is often equal to 1 — z~ (or to n„ -identity matrix in discrete- time MVC 

considerations) . 

The familiar Smith-McMillan form S M (p) (Kaczorek, 1998) of G(p) (as a special case of the 

MFD factorization (Desoer & Schulman, 1974)) is given by G(p) = U (p)S M (p)V (p) , where 

UqsR ¥ y [p] and V eR"" xn *[p] are unimodular and the pencil S M eR '' "(p) is of the 
form 



s m(p) ; 



M rxr rx („ a _ r ) 

0(n„-r)xr ®(n y -r)x(n u -r) 



(1) 



with M(p) = diag(£ 1 /y/^,^ /W2>—> s r /¥r)> where £,(p) and (//,(p), i=l,...,r (with r being 
the normal rank of G(p) ), are monic coprime polynomials such that £,(p) divides £j + i(p) , 
i = l,...,r — 1 , and y,(p) divides Wi-iiv) > z =2,.. .,r . The particular Smith form is given by 
the appropriate pencil S(p) , with M(p) = diag(£i,s 2 ,--;£ r ) often associated with Smff/i zeros 
or transmission zeros. The polynomials £,(p) are often called the invariant factors of G(p) and 

their product £ (p) = llj f;(p) is sometimes referred to as the zero polynomial of G(p) . 

Extend the discrete-time system input-output description to the form accounting for 
additive disturbances 

A(cj- 1 )y(t) = q - d B( q - 1 )u(t) + C( q - 1 )v(t) (2) 

where u(t) and y(t) are the input and output vectors, respectively, v(t) is the zero-mean 
uncorrelated disturbance vector, d is the time delay and A(q~ ), Bl^q ) and Cfq ) are the 
appropriate matrix polynomials (in the backward shift operator q ) of orders n , m and k , 
respectively. As usual, we assume that the leading coefficient of A(q~ ) is equal to the 
identity matrix. Assume that A(q~ ) and B(q~ ) as well as A(q~ ) and C(q~ ) are left 
coprime, with B(q~ ) and (stable) C(q~ ) being of full normal rank n y . For the general 
purposes and for duality with the continuous-time case, we use here the ARMAX model, 
even though it is well known that the C(q~ ) polynomial matrix of disturbance parameters 
is usually in control engineering practice unlikely to be effectively estimated (and is often 
used as a control design, observer polynomial matrix instead). 

In the sequel, we proceed with discrete-time systems only but all the results are available for 
continuous-time systems as well (Hunek, 2003; Hunek & Latawiec, under review; Latawiec, 
2004; Latawiec & Hunek, 2002; Latawiec et al., 2004). 

3. Closed-loop minimum variance control 

Consider a right-invertible system described by equation (2) and assume that the observer 
(or disturbance-related) polynomial C_(q~ ) = £o + £i'?~ + ~- + -k c l~ nas au ro °t s inside the 
unit disk. (Note: Similar results can be obtained for left-invertible systems.) 
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Then the general MVC law, minimizing the performance index 



minE 

m 



| ly(t + d)-y ref (t + d)] J [y(t + d)-y ref (t + d)] I (3) 

where y ref {t + d) and y{t + d)=C {q' 1 )^' 1 )^ )u(t) + H(q~ 1 )y(t)] + F(q~ 1 )v(t) (Hunek, 

2003; Hunek, 2007; Hunek & Latawiec, 2006; Hunek & Latawiec, under review; Latawiec, 
2004) are the output reference/ setpoint and the stochastic output predictor, respectively, is 
of form 

u(t) = B R (cj- 1 )l\q- 1 )[c( q - 1 )y ref (t + d)-H(q- 1 )y(t)} (4) 

The appropriate n y xn y -polynomial matrices F_(q~ ) = ^n +/ <f +■■■ + / cf + and 

Hil~ ) = llo + —i c l~ + ■■■ + Jln-l c J~ n+ are computed from the polynomial matrix identity 
(called Diophantine equation) 



C(^ 1 ) = F( (? - 1 )A(< ? -!) + , - d H( q - 1 ) 
with 

C(<?- 1 )E( (? - 1 ) = £( (? - 1 )C( (? - 1 ) 



(5) 



(6) 



where F(q 1 ) = I„ y + f_^ l + - + l d _ 1 1 d+1 > £(<? 1 ) = c +c 1 ( ? 1 +... + c_ k q k and I„ y denotes 
the n y -identity matrix. 

For right- invertible systems the symbol B (q ) denotes three possible classes of minimum- 
norm right T -, r - and a -inverses of the polynomial matrix B(q~ ) (Hunek, 2003; Latawiec, 
2004; Latawiec et al., 2004; Latawiec et al., 2005b; Latawiec et al., 2003). Like with 
transmission zeros for SISO and square MIMO systems, poles of the right inverse B (q~ ) 
have been defined as control zeros (Hunek, 2003; Hunek, 2007; Hunek & Latawiec, 2006; 
Hunek & Latawiec, under review; Latawiec, 1998; Latawiec, 2004; Latawiec et al., 2000; 
Latawiec & Hunek, 2002; Latawiec et al. 2005a; Latawiec et al., 2004; Latawiec et al, 2005b; 
Latawiec et al., 2003). The minimum-norm right inverse was used in the unique T -inverse 
to generate a unique set of control zeros type 1 for right-invertible systems (Hunek, 2003; 
Latawiec, 2004; Latawiec & Hunek, 2002; Latawiec et al, 2003). 

However, the formula (4) can be treated as a solver of the MVC-related matrix polynomial 
equation 

B(q- 1 )u(t) = y_(t) (7) 

where y(t) = F_~ 1 (q~ 1 y^_(q~ 1 )y re f(t + d)-H(q~ 1 )y(t)\ (Latawiec, 2004). When analyzing 
possible solutions to equation (7) we have introduced new classes of inverses of polynomial 
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matrices, that is a finite number of r -inverses and an infinite number of <r -inverses 
(Latawiec, 2004; Latawiec et al., 2004; Latawiec et al., 2005b), all surprisingly employing the 
unique minimum-norm right inverse. The r - and a -inverses contribute to generation of 
what has been referred to as control zeros type 2 (Latawiec, 2004; Latawiec et al., 2004; 
Latawiec et al., 2005b). It is interesting to note that transmission zeros make a subset of 
control zeros. 

4. New approach to MVC design 

In an attempt to essentially reduce the computational burden of the extreme points and 
extreme directions method we introduce yet another effective (and much simpler) approach 
to the MVC design, in which we make use of the equality (Hunek, 2007; Hunek & Latawiec, 
2006) 

l(q- 1 )l R ( q - 1 ) = I (8) 

Consider an LTI n u -input «„ -output system described by the ARMAX model (2). Put 
w = cf and factorize B(w) to the Smith form B(w) = U(w)S(w)V(w) , where U(w) and 
V(w) are unimodular. Now, B (w)-V~ (w)S (w)U (w) , with determinants of U(w) and 
V(w) being independent of w , that is possible instability of an inverse polynomial matrix 
B (w) being related to S (w) only. Amazingly, applying the minimum-norm right T - 

D T" T 1 

inverse S (w) = S(w) [S(w)S(w) ] guarantees that no control zeros except transmission 

zeros appear in the inverse S (w) . (Employing any other inverses, e.g. r - or a -inverses, 

causes the control zeros to appear in S (w) in addition to transmission zeros.) This 
intriguing result has been confirmed in a number of simulations but no formal proof exists, 
so far. The result confirms the value of the Smith factorization on the one hand, and the T - 
inverse on the other. 

5. Simulation example 

Consider the three-input and two-output unstable system described by noise-free 



deterministic part of model (2) with B(q ) = 



2 + 3.8<7 -1 lef 1 1 
4(j _1 1 + 1.9<7 _1 1 



Ail' 1 )- 



■■ 500 + 570 ij + 19</ and d = 2. The control zeros type 1, obtained on a basis of T -inverse 



,-l\ 



of B(q ), determine unstable MVC or perfect control of the system. Besides, it is very 
difficult to find stable MVC/ perfect control on the basis of r - and a -inverses with control 
zeros type 2 associated. Since the system has one (stable) transmission zero at z = 0.1 , it is 
impossible to employ the extreme points and extreme directions method. Therefore, we 

apply our new method of Section 4. Now, after substitution w = q~ and after Smith 
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factorization we obtain U(w) -- 
and finally 











3.8w + 2 


2w 1 


(1 




10 








1 


, S(w) = 


w-10 


, V(w) = 


0.2 


-0.1 













-1 



i{t)-- 



5 -0.5 

-1 

1 -10 -19q~ r 1 + 3.9CJ' 1 



1 
-1 1 



1 

-^— 

if 1 -10 



-1\ 



(9) 



£ (D[Vref(t + d)-H(^ L )y(t) 



with specific forms of F (cj 1 ) and H (<7 x ) not presented here due to their mathematical 
complexity. Now, for yl re f = 1 , y2 re e =1.5 the outputs remain at the setpoint for t>d = 2 
under the stabilizing perfect control, whose plots w 1 (f), w 2 (f) ar, d M 3(0 according to 
equation (9) are shown in Fig. 1. For clarity, we have chosen to show the performance of 
(noise-free) perfect control rather than MVC. 



u,W 



u,(t) 



u 2 (t) 



6 8 10 12 14 16 18 20 



Fig. 1. Perfect control plots for the specific example 

Remark. However, the Smith factorization approach undeniably contributes to the robust 
MVC design, in the majority cases the application of the control zeros can give much better 
results (Hunek, under review), giving rise to the extension of the Davison's theory of 
minimum phase systems (Davison, 1983). Unfortunately, there exists no formal proof of the 
above statement and it has been left for future research. 



6. Conclusions 

The Smith factorization approach to the robust minimum variance control has been 
presented in this paper. The new method appears much better than others, designed by 
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authors, namely those based on the extreme points and extreme directions method and 
second one called minimum-energy. Firstly, it is computationally much simpler and 
secondly, it works also for the case when transmission zeros are (nongenerically) present in 
the nonsquare system. Strange enough, the presented method should operate on the T - 
inverse exclusively and any other inverse applied gives rise to the appearance of control 
zeros. What is also strange, applying the T -inverse directly to the polynomial B(.) (rather 
than to its Smith-factorized form) inevitably ends up with control zeros. Finally, the new 
approach confirms the Davison's theory and indicates the need of the introduction of the 
complementary control zeros theory. 
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1. Introduction 



Semiconductor manufacturers prefer automatic machines due to quality, productivity and 
effectiveness. Compared with other industries, the semiconductor industry has automized 
the individual steps of a process to a relatively high level. Most of the operators usually 
learn about wafer placement rather than the principle of the process. But the manual 
systems do not know the placement of a wafer, so operators should set the initial conditions 
for the process. Wafer alignment is an operation for correcting the current wafer position in 
the system coordinate until the wafer is located at the target position. The wafer position 
varies after loading, so alignment steps are required. 

Manual alignment systems need the operators' help every time the wafer is loaded and is 
actually a time-consuming process, that lowers manufacturing productivity and raises costs. 
So, automatic alignment can save these time and costs. If the machines have automatic 
alignment function, wafer processes can be connected automatically. Operators then would 
only have to check and monitor the processing situation, and fix a problem when it arises. 
Therefore, one operator can operate more machines, and would not be required to have high 
process skills. Fig. 1 shows the concept of wafer alignment in the dicing process. 
Passive alignment is related with mechanical structures which persist into external forces 
without any actuators. A self-constrained mechanism is developed by Choi. When axial 
force is exerted on a stage, grooves beneath the stage generate internal stress which prevents 
from moving the stage (Choi et al., 1999). Pyramid and groove mechanism make stacking 
force under external stress (Slocum & Weber, 2003). 

Active alignment skill using sensors and actuators is applied widely in semiconductor 
manufacturing process. Anderson's method (Anderson et al, 2004) is based on cross-relation 
between a defined template and an inspected image. He segmented the pixels around a 
peak and interpolated under sub-pixel level. Misalignment can be detected by Moire effect 
and laser beam. PZT actuators are applied to remove misalignment in lithography (Fan et 
al., 2006). Machine vision is a common device to detect misalignment in wafer aligment 
(Hong & Fang, 2002). 

We have developed an algorithm for wafer alignment. This alignment algorithm was 
derived from rigid body transformation or object transformation. The algorithm was based 
on the simultaneous motion of x-y- axes in 2D space. A 2-step algorithm has a simple 
form which can be written by 2x2 matrix operations. The magnitude of misalignment was 
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reduced in base of macro and micro inspection data(Kim et al., 2004). The matrix was 
expanded with 4x4, and was included in the equation. So, the alignment variables of x-y- 
can be calculated within one equation(Kim et al, 2004), (Kim et al, 2006). The 
manufacturing condition which was varied to be ideal conditions, affected the quality of 
alignment. Sometimes the misalignment did not become zero after only macro-micro 
alignment. In this case, the problem can be solved by iterative alignment equation, which is 
similar to numerical algorithms (Kim et al, 2007). The convergence speed of the iteration can 
be controlled by the convergence constant. This constant can prevent numerical vibration. 
The convergence analysis method, similar to a numerical method, is proposed(Kim et al, 
2006). We tried to obtain the exact solution in these studies, but they found that the solutions 
can be obtained from an estimated equation. The idea in this study came from the estimated 
method. The equation could be simplified by making a few assumptions of the wafer 
alignment condition. The derived formula did not have the terms for rotational center, 
which was impossible to measure exactly. The alignment results showed that the 
performance of the proposed algorithm was similar to the exact solution, and that the error 
convergence speed can be controlled. 




Wafer Cutting Line 




Alignment => xy6 compensation 

Fig. 1. Concept of wafer alignment in dicing process 



2. Review of 2D alignment model 

2.1 Coordinate transformation for alignment(Kim et al, 2004) 

The machine coordinate in the alignment system has three variables - x, y and . Let be the 
P(x,y) original coordinate, and P'(x',y') be the transformed coordinate. The each coordinate 
can be shown as follows 



P = (x y 9 l) T P' = (x' y' 9' l) T P = (C X C, 0) 2 



(i) 
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The alignment space is on 2D, the alignment procedure carries out translation and rotational 
motion. The movement can be described by rigid body transformation and the coordinate 
after the motion can be calculated simply by multiplying matrices. The notation of the 
translational matrix usually has a "F, the rotational matrix has an 'R' and the center of 
rotation has a 'C. Then the transformation is formulated by equation (2). 



P' = T{R(P-C) + C} 



(2) 



TR matrices in the wafer alignment system can be derived as equations (3) and (4), which 
are 4x4 matrices. 



1 Ax" 


1 


Aj 


1 








1 



(3) 



R = 



cosA6> -sinA6> 
sinA6> cosA6> 
1 A0 

1 



(4) 



2.2 Basic alignment algorithm(Kim et al, 2006) 

The wafer has marks for alignment. The ideal mark position P t is stored when the wafer is 
perfectly aligned. Misalignment is calculated from the current position P c . The vision system 
inspects the location of the mark on the screen. These mark positions can be defined as 
follows. 



P t = (x, y t 0, \) T P c = (x c y c C \) T (5) 

When the current position of the mark is deviated from the ideal one, the resulting 
displacement can be defined as 4. The mark position in the machine can be obtained from 
the target position and the displacement by equation (6). 

P c aP t + A = (x e + Ax, y c + Ay, 6 C + A6, 1/ (6) 

If the current position is compensated with an arbitrary value a = (a x, a y , a 6 , 0), the mark 
will be located at the target position. So, an alignment algorithm f(x) can be written by 
equation (7). 

f(P c ,a)-P t =0 (7) 

f(Pc,a) can be replaced with the equation from the rigid body transformation. The result is 
shown as (8). The T and R have the unknown compensation variable for the current 
position. 
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T{R(P C -C) + C}-P t = (8) 

The plus direction between the mathematical coordinate and vision can be reverse, and the 
relation can be written by vision direction matrix D v whose diagonal terms have a value of 
either +1 or -1 and the other terms are zero. 

T{R(P t +D v A-C) + C}-P t = (9) 

The direction problem can occur between the math coordinate and the machine, and the 
machine direction matrix D m has the similar characteristics as D v . 

P = D m a = {fi x P y p 6 If (10) 

The unknown a can be calculated from the equation, and (11) and (12) are the exact solution 
in the case when two points are inspected to align a line. 



«*„ =x tn -C x -(x cn -C x )cos6 n +(y cn -C y )smd n 
a y n = y tn -C y -(x cn -C x )smO n -(y cn -C y )cos6 n 



(ll) 



tan „ _ fci - x ci)(y t i -y t2 )-(y c i -y c i)( x t\ - x , 2 ) 

IdilUg — (iZ) 

(x tl - x a )(x cl - x cl ) + (y a - y n )(y cX - y c2 ) 

3. Centerless model 

3.1 Simplification 

The equations (11) and (12) are the exact solution, but estimated solutions have been used 
for many numerical problems. Some variables can be erased by geometric relations and 
alignment conditions. Fig. 2 shows the general condition for wafer alignment in the dicing 
process. First, angular misalignment in the process is within ±2°, which means sin a*" a and 
cos a K l. the equation (11) can be written as follows 

«», = X tn- X cn +(ycn- C y) sina en = ~ A xn + & en' yKn 

(13) 

o. = v, -v -(x -C )sina fl =—A —(x -C )a a 

yn y tn • en \ en x/ 8n yn \ en xs On 

Second, inspection is carried out at two points, and the compensation value is the average, 
a=(ai+a2)/2. And the inspection points are axis-symmetric at the rotational center, 
2C x -(x c i+x C 2)=0. Another assumption is that the mark position is defined near the rotational 
center, 2C y -(y c i+y C 2) = 0. Therefore, the equation (13) can be expressed as (14). 

«„ =-0,1 +4^/2 

(14) 

a yn =-(A n+ A yl )/2 

The x-stroke of two points is actually constant, l=x t 2 _ Xti ai Xc2 _ Xci because the x variation by 
misalignment is much smaller than the moving stroke. The y-stroke of two points is actually 
zero, yt2 _ yti = 0. Equation (15) is derived from these relations. 
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Fig. 2. General conditions for wafer alignment 



tana. 



/ 



(15) 



The derived equations are much simpler and have no center terms for rotation. Considering 
the wafer alignment system, there are three centers, as shown Fig. 3: rotational center, wafer 
center, and chuck center. The positions of these three centers are different, but cannot be 
measured exactly or cannot be fixed at the same position. Centerless algorithm cannot give 
an exact solution, but an estimated one, so the iteration step is necessary. 




Fig. 3. Different Centers on alignment table 
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initial mis-alignment 




Inspection 
Position 1 



Middle Point of 
Inspection Points 



Inspection 
Position 2 



Fig. 4. Concept of movement for the points in alignment 

The equation can be explained in another way. Fig. 4 shows the virtual alignment 
procedure. The inspection positions are located at both ends of the lines, and the middle 
point can represent the inspection points. The misalignment of the middle point can be 
estimated from the average of the misalignment on the two points. And the compensation 
value of the middle point is approximately equal to the magnitude of the misalignment, and 
the result can be written as (14) and (15). 

3.2 Iteration and convergence 

The alignment properties are estimated, and it is possible that the misalignment cannot be 
zero after the first alignment. And manufacturing conditions can vary and are different from 
the ideal condition. The alternative is the iterative alignment, which is carried out until the 
misalignment decreases under tolerance level. Compensation values are added to the 
current position, and the i-th step of the alignment can be written as equation (16). It is 
necessary to control convergence speed, and the convergence constant is defined as n. 



P c = *p + rjx^ 



(16) 



Misalignment can be defined as the magnitude of the deviation, as shown in (17). The 
iteration is terminated when the misalignment becomes smaller than tolerance e. 



l (A 2 xl+ A 2 x2 ) + (Al l+ A%) _ 



(17) 
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As the automatic alignment proceeds, the numerical position on the machine is 
compensated. The magnitude of the compensation value p will decrease by the number of 
iterations, y is defined as the total amount of the compensation or difference between initial 
position and current position. The y will be converged to an arbitrary value if the alignment 
is done well. 



l y=-y + n x , p= , P c - Q P c 



(17) 




Fig. 5. Photo of automatic wafer dicing machine 



4. Experiment 

4.1 Dicing machine 

The algorithm was tested on the wafer dicing machine. The machine had 4 axes - x, y, z and 
. Because the z axis is actually used only for cutting action, it was excluded for alignment. 
Each axis is composed of a ball-screw and LM guide. The axes were accurate to the micro- 
level. Fig. 5 shows a picture of the dicing machine, and table 1 shows its specifications. The 
align mark was inspected with a PCI frame grabber, which has 0.1 sub-pixel accuracy. Two 
cameras were connected to the frame grabber. Very small lenses were attached to the 
cameras, which had same magnifications because of the align patterns. 
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contents 


name/ axis 


specification 


resolution 

linear scale 
pitch 

motor 

backlash 


X 

y 
e 

y 

X 

y 
e 

X 

y 
e 

X 

y 


1pm 

0.2pm 

2.0 xlO-Meg 

lOOOOps/mm 

lOmm/rev 

4mm/ rev 

l/100(harmonic driver) 

lOOOOps 

20000ps 

18000ps 

7pm 

0pm 


camera 


model 

pixel 

FOV 

exposure 


Sony XC-HR50 

640x480 

1.31mmx0.98mm 

20ms 


frame grabber 


model 
Precision 

library 


Cog8501 
0.1 sub-pixel 
Vision Pro 3.5 



Table 1. Specification of Machine 



4.2 Procedure 

First, a wafer was aligned manually to get a standard pattern on an inspected image. The 
shape of align pattern in the experiment was the cross mark. The machine was moved to 
search for the best pattern of a high-quality image. The image of the mark and training 
results were stored. The inspection positions were defined at the mechanical positions when 
the align pattern was located at the center of the screen. The inspection positions were 
defined for dicing lines of #=0°and <?= 90 °. 

After the position definition, the wafer was released from the table and placed again 
manually to give random errors. The system began aligning from ° alignment, and then it 
proceeded to 90° alignment. The magnifications of the lenses were the same, and the macro- 
alignment had no meaning. Three lines were aligned for °, and ten for 90 °. The deviations 
were stored at each inspection points. The compensation values were obtained from the 
deviation. The system axes were compensated by these values, so the inspection positions 
were also changed. The inspection and compensation were iterated until the misalignment 
was under 1.0 pixel. The convergence constant n was varied from 0.5 to 1.5 without 
releasing the wafer. The experiment was repeated for 15 cases of mark locations on the 
vision screen, as shown in the previous report. 



The Wafer Alignment Algorithm Regardless of Rotational Center 



389 



5. Result 

Wafers were aligned successfully for the 15 cases. One of the cases is shown in this paper. 
The variation of y's is shown in Fig. 6 - Fig. 11. The horizontal axis of the plots was the 
number of iterations, and the vertical axis was the compensation value. The thick-black line, 
called 'org' is the result obtained by the use of the original equation before the 
simplification. The convergent path was varied by n, and the convergent speed can be 
controlled. When the n increased, the path showed a bigger overshoot and sharp edges. This 
kind of trend always occurred when the value was above 1.0. This also makes the alignment 
time longer. When the n drcreases, the path showed a smooth shape and longer alignment 
time. This trend always occurred when the n was below 1.0. There was no overshoot for this 
case. The original equation provided the fastest convergence for a case, but the result 
showed that the simplified equation was also effective for application. The best value for y x 
was 1.1, but that of y y was 1.0. The response of the best case was delayed about 1.0 step for 
the case of the original equation. 



6. Conclusion 

The proposed equation for wafer alignment was derived from object transformation and 
simplified with some assumptions made. The alignment algorithm had the iteration terms 
with the convergence constant. The algorithm was applied to a dicing machine. After setup, 
the wafers were placed manually, and aligned with the proposed algorithm. The iteration 
was terminated until the measured misalignment became below 1.0 vision pixel. The 
convergence constant n varied from 0.5 to 1.5 for each case. For all 15 cases, the alignment 
was finished. The result showed that the simplified algorithm converged slower than the 
original equation. But the delay was about 1.0 step according to the convergent constant. 
When the n was above 1.0, the response curve had an overshoot and sharp edges. But the 
curve had smoothness and slower response when the n was below 1.0. The alignment 
algorithm was simple so that it can be applied to PLC-base systems. 
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Fig. 6. Variation of y x by iteration (y x =l.l-1.5) 
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Fig. 7. 
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Fig. 8. Variation of y y by iteration (Yy=l.l-1.5) 
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Fig. 9. Variation of yy by iteration (y y =0. 5-1.0) 
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